FIGURE 1 



GGGGCTTCGGCGCCAGCGGCCAGCGCTAGTCGGTCTGGTAAGGATTTACAAAAGGTGCAGGTATG 
AGCAGGTCTGAAGACTAACATTTTGTGAAGTTGTAAAACAGAAAACCTGTTAGAAATGTGGTGGT 
TTCAGCAAGGCCTCAGTTTCCTTCCTTCAGCCCTTGTAATTTGGAGATCTGCTGCTTTCATATTT 
TCATACATTACTGCAGTAACACTCCACCATATAGACCCGGCTTTACCTTATATCAGTGACACTGG 
TACAGTAGCTCCAGAAAAATGCT TAT T TGGGGCAAT GC TAAATATTGCGGCAGTTTTATGCATTG 
CTACCATTTATGTTCGTTATAAGCAAGTTCATGCTCTGAGTCCTGAAGAGAACGTTATCATCAAA 
TTAAACAAGGCTGGCCTTGTACTTGGAATACTGAGTTGTTTAGGACTTTCTATTGTGGCAAACTT 
CCAGAAAACAACCCTTTTTGCTGCACATGTAAGTGGAGCTGTGCTTACCTTTGGTATGGGCTCAT 
TATATATGTTTGTTCAGACCATCCTTTCCTACCAAATGCAGCCCAAAATCCATGGCAAACAAGTC 
TTCTGGATCAGACTGTTGTTGGTTATCTGGTGTGGAGTAAGTGCACTTAGCATGCTGACTTGCTC 
ATCAGTTTTGCACAGTGGCAATTTTGGGACTGATTTAGAACAGAAACTCCATTGGAACCCCGAGG 

ACAAAGGT TAT GT GC T TCACAT GATCACTAC TGCAGCAGAAT GGT CTAT GT CAT TTTCCTTCTTT 
GGTTTTTTCCTGACTTACATTCGTGATTTTCAGAAAATTTCTTTACGGGTGGAAGCCAATTTACA 

T GGAT TAACCC T C TAT GAC ACT GCACCT TGCC CT AT T AACAAT G AACGAAC ACGGCTAC T T TCCA 
GAGATATTTGATGAAAGGATAAAATATTTCTGTAATGATTATGATTCTCAGGGATTGGGGAAAGG 

T T C AC AG AAG T T G C T TAT TCTTCTCT GAAAT T T T C AAC CAC T T AAT C AAGGC T GACAG T AAC AC T 
GAT G AAT GC T GAT AAT C AG GAAAC AT G AAAG AAG C C AT T T GAT AG AT T A T T C T AAAGG AT AT CAT 
CAAGAAGAC TAT T AAAAAC ACC T ATGCC TAT ACT T T T T TAT C T C AGAAAATAAAG T CAAAAGAC T 
ATG 
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FIGURE 2 



<subunit 1 of 1, 266 aa, 1 stop 
<MW: 29766, pi: 8.39, NX(S/T): 0 

MWWFQQGLSFLPSALVIWTSAAFIFSYITAVTLHHIDPALPYISDTGTVAPEKCLFGAMLNIAAV 

LC I AT I YVRYKQVHALS PEENVI I KLNKAGLVLG I LSCLGLS I VANFQKTTLFAAHVSGAVLT FG 
MGSLYMFVQTILSYQMQPKIHGKQVFWIRLLLVIWCGVSALSMLTCSSVLHSGNFGTDLEQKLHW 
NPEDKGYVLHMITTAAEWSMSFSFFGFFLTYIRDFQKISLRVEANLHGLTLYDTAPCPINNERTR 

LLSRDI 

Important features: 

Type II transmembrane domain: 



111 amino acids 13-33 



Other Transmembrane domains: 

amino acids 54-73, 94-113, 160-180, 122-141 



O N-myristoylation sites. 



Ill amino acids 57-63, 95-101, 99-105, 124-130, 183-189 
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FIGURE a 



CGGACGCGTGGGCGGACGCGTGGGGGAGAGCCGCAGTCCCGGCTGCAGCACCTGGGAGAAGGCAGACC 

GTGTGAGGGGGCCTGTGGCCCCAGCGTGCTGTGGCCTCGGGGAGTGGGAAGTGGAGGCAGGAGCCTTC 

CTTACACTTCGCCATGAGTTTCCTCATCGACTCCAGCATCATGATTACCTCCCAGATACTATTTTTTG 

GAT TT GGGT GGCT T TTCTTCATGC GCCAATTGTTTAAAGACTATGAGATACGTCAGTAT GTTGTACAG 

GTGATCTTCTCCGTGACGTTTGCATTTTCTTGCACCATGTTTGAGCTCATCATCTTTGAAATCTTAGG 

AGTAT TGAATAGCAGCT CCCGTTATTT TCACTGGAAAATGAACCTGTGT GTAATT CTGCTGAT CCT GG 

TTTTCATGGTGCCTTTTTACATTGGCTATTTTATTGTGAGCAATATCCGACTACTGCATAAACAACGA 

CTGCTTTTTTCCTGTCTCTTATGGCTGACCTTTATGTATTTCTTCTGGAAACTAGGAGATCCCTTTCC 

CATTCTCAGCCCAAAACATGGGATCTTATCCATAGAACAGCTCATCAGCCGGGTTGGTGTGATTGGAG 

TGACTCTCATGGCTCTTCTTTCTGGATTTGGTGCTGTCAACTGCCCATACACTTACATGTCTTACTTC 

CTCAGGAATGTGACTGACACGGATATTCTAGCCCTGGAACGGCGACTGCTGCAAACCATGGATATGAT 

CATAAGCAAAAAGAAAAGGATGGCAATGGCACGGAGAACAATGTTCCAGAAGGGGGAAGTGCATAACA 

AACCATCAGGT T TCT GGGGAATGATAAAAAGTGTT ACCACT TCAGCAT CAGGAAGT GAAAATCTTACT 

CTTATTCAACAGGAAGTGGATGCTTTGGAAGAATTAAGCAGGCAGCTTTTTCTGGAAACAGCTGATCT 

ATATGCTACCAAGGAGAGAATAGAATACTCCAAAACCTTCAAGGGGAAATATTTTAATTTTCTTGGTT 

ACTTTTTCTCTATTTACTGTGTTTGGAAAATTTTCATGGCTACCATCAATATTGTTTTTGATCGAGTT 

GGGAAAACGGATCCTGTCACAAGAGGCATTGAGATCACTGTGAATTATCTGGGAATCCAATTTGATGT 

GAAGT TT T GGTC CCAACACATT TCCTT CATT CTTGT T GGAATAAT CAT CGTCACAT CCATCAGAGGAT 

TGCTGATCACTCTTACCAAGTTCTTTTATGCCATCTCTAGCAGTAAGTCCTCCAATGTCATTGTCCTG 

CTATTAGCACAGATAATGGGCATGTACTTTGTCTCCTCTGTGCTGCTGATCCGAATGAGTATGCCTTT 

AGAATACCGCACCATAATCACTGAAGTCCTTGGAGAACTGCAGTTCAACTTCTATCACCGTTGGTTTG 

ATGTGATCTTCCTGGTCAGCGCTCTCTCTAGCATACTCTTCCTCTATTTGGCTCACAAACAGGCACCA 

GAGAAGCAAATGGCACCTTGAACTTAAGCCTACTACAGACTGTTAGAGGCCAGTGGTTTCAAAATTTA 

GAT ATAAGAGGGGGGAAAAAT GGAACCAGGGCCTGACATT T TAT AAACAAACAAAATGCTATGGT AGC 

ATTTTTCACCTTCATAGCATACTCCTTCCCCGTCAGGTGATACTATGACCATGAGTAGCATCAGCCAG 

AACATGAGAGGGAGAACTAACTCAAGACAATACTCAGCAGAGAGCATCCCGTGTGGATATGAGGCTGG 

TGTAGAGGCGGAGAGGAGCCAAGAAACTAAAGGTGAAAAATACACTGGAACTCTGGGGCAAGACATGT 

CTATGGTAGCT GAGCCAAACACGT AGGATTT CCGT TT TAAGGTT CACATGGAAAAGGTTAT AGCTTT G 

CCTTGAGATTGACTCATTAAAATCAGAGACTGTAACAAAAAAAAAATU^AAAAAAAAAGGGCGGCCGCG 

ACTCTAGAGTCGACCTGCAGAAGCTTGGCCGCCATGGCCCAACTTGTTTATTGCAGCTTATAATG 
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FIGURE 4 



MS FLIDSSIMITSQILFFG FGW L FFMRQL FKD YE I RQYWQVI FS VT FAFS C TMFE L 1 1 FE I LGV 
LNSSSRYFHWK^LCVILLILVFMVPFYIGYFIVSNIRLLHKQRLLFSCLLWLTFMYFFWKLGDP 
FPILSPKHGILSIEQLISRYGVIGVTIJ>IM,LSGFGAVNCPYTYMSYFLRNVTDTDILALERRLLQ 

TMDMI I SKKKRMAMARRTMFQKGEVHNKP S GFWGMI KS VT T SAS GSENLTL IQQEVDALEELSRQ 
LFLETADLYATKERIEYSKTFKGKYFNFLGYFFSIYCVWKIFMATINIVFDRVGKTDPVTRGIEI 
TVNYLGIQFDVKFWSQHISFILVGI I IVTSIRGLLITLTKFFYAISSSKSSNVIVLLLAQIMGMY 
FVS SVLL I RMSMPLE YRT 1 1 TEVLGE LQFN FYHRW FDVI FLVSALS S I L FL YLAHKQAPEKQMAP 

Important features: 
Signal peptide: 

amino acids 1-23 

Potential transmembrane domains: 

amino acids 37-55, 81-102, 150-168, 288-311, 338-356, 375-398, 
425-444 

N-glycosylation sites . 

amino acids 67-70, 180-183 and 243-246 

Eukaryotic cobalamin-binding proteins 
amino acids 151-160 
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FIGURE 5 



AGCAGGGAAAT CCGGATGT CTCGGT T ATGAAGT GGAGCAGTGAGT GT GAGCC TCAACATAGT TCC 
AGAACTCTCCATCCGGACTAGTTATTGAGCATCTGCCTCTCATATCACCAGTGGCCATCTGAGGT 
GTTTCCCTGGCTCTGAAGGGGTAGGCACGATGGCCAGGTGCTTCAGCCTGGTGTTGCTTCTCACT 
TCCATCTGGACCACGAGGCTCCTGGTCCAAGGCTCTTTGCGTGCAGAAGAGCTTTCCATCCAGGT 
G TCATGCAGAAT TATGGGGATCACC CT TGT GAGCAAAAAGGCGAACCAGCAGCT GAAT T T CACAG 
AAGCT AAGGAGGC CT GTAGGCTGCT GGGAC TAAGT T TGGCC GGCAAGGACCAAGT T GAAACAGCC 
T TGAAAGCTAGCT TT GAAACTTGCAGCTATGGCTGGGT TGGAGATGGATTCGTGGTCATCTC TAG 
GATTAGCCCAAACCCCAAGTGTGGGAAAAATGGGGTGGGTGTCCTGATTTGGAAGGTTCCAGTGA 
GCCGACAGT TT GCAGCCTATTGT TACAACT CATCTGATACT TGGACTAACTCGTGCAT TCCAGAA 
y : AT TATCACCACCAAAGATCCCATAT TCAACACT CAAAC TGCAACACAAAGAACAGAAT T TAT TG T 

LI CAGTGACAGTACCTACTCGGTGGCATCCCCTTACTCTACAATACCTGCCCCTACTACTACTCCTC 
C TGC T CCAGCT TCCACT TC TATTCCACGGAGAAAAAAAT TGAT T TG TGTCACAGAAGT T T T TAT G 
O GAAACTAGCACCATGTCTACAGAAACTGAACCATTTGTTGAAAATAAAGCAGCATTCAAGAATGA 
If! AGCTGCTGGGTTTGGAGGTGTCCCCACGGCTCTGCTAGTGCTTGCTCTCCTCTTCTTTGGTGCTG 
hi CAGCTGGTCTTGGATTTTGCTATGTCAAAAGGTATGTGAAGGCCTTCCCTTTTACAAACAAGAAT 
CAGCAGAAGGAAATGATCGAAACCAAAGTAGTAAAGGAGGAGAAGGCCAATGATAGCAACCCTAA 
TGAGGAATCAAAGAAAACTGATAAAAACCCAGAAGAGTCCAAGAGTCCAAGCAAAACTACCGTGC 
y 1 GATGCC T GGAAGC T GAAGT T TAGAT GAGAC AGAAAT GAGGAGACACACC T GAGGC TGGTTTCTTT 

M CATGCTCCTTACCCTGCCCCAGCTGGGGAAATCAAAAGGGCCAAAGAACCAAAGAAGAAAGTCCA 
s C CC TTGGT T C CTAACTGGAATCAGCTCAGGACT GCCAT TGGACTAT GGAGT GCACCAAAGAGAAT 

f*\ GCCCTTCTCCTTATTGTAACCCTGTCTGGATCCTATCCTCCTACCTCCAAAGCTTCCCACGGCCT 
*jj TTCTAGCCTGGCTATGTCC TAATAATATCCCACTGGGAGAAAGGAGTT TT GCAAAGTGCAAGGAC 

|y CTAAAACATCTCATCAGTATCCAGTGGTAAAAAGGCCTCCTGGCTGTCTGAGGCTAGGTGGGTTG 
C3 AAAGCCAAGGAGTCACTGAGACCAAGGCTTTCTCTACTGATTCCGCAGCTCAGACCCTTTCTTCA 
flj GCTCTGAAAGAGAAACACGTATCCCACCTGACATGTCCTTCTGAGCCCGGTAAGAGCAAAAGAAT 
fZj GGCAGAAAAGT TT AGCCCC TGAAAGCCAT GGAGAT TCTCATAAC TT GAGACC TAAT CT CTGTAAA 

GCTAAAATAAAGAAATAGAACAAGGCTGAGGATACGACAGTACACTGTCAGCAGGGACTGTAAAC 
' 11 ACAGACAGGGT CAAAGT GT TT TCT CT GAACACAT T GAGT TGGAATCACTGT TTAGAACACACACA 

CTTACTTTTTCTGGTCTCTACCACTGCTGATATTTTCTCTAGGAAATATACTTTTACAAGTAACA 
AAAAT AAAAAC TC T TATAAATT T CT AT T T T T AT CT GAG T TACAGAAATGAT TAC T AAGGAAGAT T 
ACT CAGTAATTTGTT TAAAAAGTAATAAAAT TCAACAAACAT TTGCTGAATAGCTACTATATGTC 
AAG TGC TGTGCAAGGTATTACAC T C T GTAAT TGAATAT TAT T CC TCAAAAAATT GCACATAG TAG 
AACGCTATCTGGGAAGCTATTTTTTTCAGTTTTGATATTTCTAGCTTATCTACTTCCAAACTAAT 
TTTTATTTTTGCTGAGACTAATCTTATTCATTTTCTCTAATATGGCAACCATTATAACCTTAATT 
TATTATTAACATACCTAAGAAGTACATTGTTACCTCTATATACCAAAGCACATTTTAAAAGTGCC 
AT T AACAAAT G TAT C AC TAGCCCT CCT TT T T CCAACAAGAAGGG AC T GAGAGAT GC AGAAATAT T 
TGTGACAAAAAATTAAAGCATTTAGAAAACTT 
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FIGURE 6 



MARCFSLVLLLTSIWTTRLLVQGSLRAEELSIQVSCRIMGITLVSKKANQQLNFTEAKEACRLLG 

LSLAGKDQVETALKAS FETCSYGWVGDGFWI SRI S PNPKCGKNGVGVL IWKVPVSRQFAAYCYN 
SSDTWTNSCIPEIITTKDPIFNTQTATQTTEFIVSDSTYSVASPYSTIPAPTTTPPAPASTSIPR 
RKKLICVTEVFMETSTMSTETEPFVENKAAFKNEAAGFGGVPTALLVLALLFFGAAAGLGFCYVK 
RYVKAFPFTNKNQQKEMIETKWKEEKAND^ 

Signal sequence: 

amino acids 1-16 

Transmembrane domain: 

amino acids 235-254 

N-glycosylation site. 

amino acids 53-57, 130-134, 289-293 



Casein kinase II phosphorylation site. 

amino acids 145-149, 214-218 

Tyrosine kinase phosphorylation site. 

amino acids 79-88 



N-myristoylation site. 

amino acids 23-29, 65-71, 234-240, 235-239, 249-255, 253-259 
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FIGURE 7 



CGCCGCGCTCCCGCACCCGCGGCCCGCCCACCGCGCCGCTCCCGCATCTGCACCCGCAGCCCGGC 
GGCCTCCCGGCGGGAGCGAGCAGATCCAGTCCGGCCCGCAGCGCAACTCGGTCCAGTCGGGGCGG 
CGGCTGCGGGCGCAGAGCGGAGATGCAGCGGCTTGGGGCCACCCTGCTGTGCCTGCTGCTGGCGG 
CGGCGGTCCCCACGGCCCCCGCGCCCGCTCCGACGGCGACCTCGGCTCCAGTCAAGCCCGGCCCG 
GCTCTCAGCTACCCGCAGGAGGAGGCCACCCTCAATGAGATGTTCCGCGAGGTTGAGGAACTGAT 
GGAGGACACGCAGCACAAATTGCGCAGCGCGGTGGAAGAGATGGAGGCAGAAGAAGCTGCTGCTA 
AAGCATCATCAGAAGTGAACCTGGCAAACTTACCTCCCAGCTATCACAATGAGACCAACACAGAC 
ACGAAGGTTGGAAATAATACCATCCATGTGCACCGAGAAATTCACAAGATAACCAACAACCAGAC 
L& T GGACAAAT GGTC T T T TCAGAGACAGT TATCACAT CT GTGGGAGACGAAGAAGGCAGAAGGAGCC 

Q ACGAGTGCAT CAT CGAC GAGGAC TGTGGGCCCAGCATGT AC TGCCAGTT TGCCAGC TT CCAGTAC 

f{ ACCTGCCAGCCATGCCGGGGCCAGAGGATGCTCTGCACCCGGGACAGTGAGTGCTGTGGAGACCA 
GCTGTGTGTCTGGGGTCACTGCACCAAAATGGCCACCAGGGGCAGCAATGGGACCATCTGTGACA 
f J ACCAGAGGGACTGCCAGCCGGGGCTGTGCTGTGCCTTCCAGAGAGGCCTGCTGTTCCCTGTGTGC 
JJf ACACCCCTGCCCGTGGAGGGCGAGCTTTGCCATGACCCCGCCAGCCGGCTTCTGGACCTCATCAC 
CTGGGAGCTAGAGCCTGATGGAGCCTTGGACCGATGCCCTTGTGCCAGTGGCCTCCTCTGCCAGC 
CP CCCACAGCCACAGCCTGGTGTATGTGTGCAAGCCGACCTTCGTGGGGAGCCGTGACCAAGATGGG 
N GAGATCC TGC TGCCCAGAGAGGT CC CCGATGAGTAT GAAGT TGGCAGCT TCAT GGAGGAGGT GCG 

:= CCAGGAGCTGGAGGACC TGGAGAGGAGCC TGACTGAAGAGATGGC GC TGGGGGAGCCT GCGGCT G 

Q CCGCCGCTGCACTGCTGGGAGGGGAAGAGATTTAGATCTGGACCAGGCTGTGGGTAGATGTGCAA 
|if TAGAAATAGCTAATTTATTTCCCCAGGTGTGTGCTTTAGGCGTGGGCTGACCAGGCTTCTTCCTA 
CATCTTCTTCCCAGTAAGTTTCCCCTCTGGCTTGACAGCATGAGGTGTTGTGCATTTGTTCAGCT 
!*■ CCCCCAGGCTGTTCTCCAGGCTTCACAGTCTGGTGCTTGGGAGAGTCAGGCAGGGTTAAACTGCA 
;Jf GGAGCAGTTTGCCACCCCTGTCCAGATTATTGGCTGCTTTGCCTCTACCAGTTGGCAGACAGCCG 
O TTTGTTCTACATGGCTTTGATAATTGTTTGAGGGGAGGAGATGGAAACAATGTGGAGTCTCCCTC 
ft! TGATTGGTTTTGGGGAAATGTGGAGAAGAGTGCCCTGCTTTGCAAACATCAACCTGGCAAAAATG 
CAACAAATGAATTTTCCACGCAGTTCTTTCCATGGGCATAGGTAAGCTGTGCCTTCAGCTGTTGC 
AGATGAAATGTTCTGTTCACCCTGCATTACATGTGTTTATTCATCCAGCAGTGTTGCTCAGCTCC 
TACCTCTGTGCCAGGGCAGCATTTTCATATCCAAGATCAATTCCCTCTCTCAGCACAGCCTGGGG 
AGGGGGTCATTGTTCTCCTCGTCCATCAGGGATCTCAGAGGCTCAGAGACTGCAAGCTGCTTGCC 
CAAGTCACACAGCT AGTGAAGAC CAGAGCAGT T TCATCTGGT TGT GACTC TAAGC T CAGT GCTCT 
CTCCACTACCCCACACCAGCCTTGGTGCCACCAAAAGTGCTCCCCAAAAGGAAGGAGAATGGGAT 
TTTTCTTGAGGCATGCACATCTGGAATTAAGGTCAAACTAATTCTCACATCCCTCTAAAAGTAAA 
C T ACT GT TAGGAACAGCAGTGT TCT CACAGT GT GGGGCAGCCGTCCT TCTAAT GAAGACAATGAT 
ATTGACACTGTCCCTCTTTGGCAGTTGCATTAGTAACTTTGAAAGGTATATGACTGAGCGTAGCA 
TACAGGT TAAC CT GCAGAAACAGTACT TAGGT AAT T GT AGGGCGAGGAT TATAAATGAAAT TTGC 
AAAATCACTTAGCAGCAACTGAAGACAATTATCAACCACGTGGAGAAAATCAAACCGAGCAGGGC 
TGTGTGAAACATGGTTGTAATATGCGACTGCGAACACTGAACTCTACGCCACTCCACAAATGATG 
TTTTCAGGTGTCATGGACTGTTGCCACCATGTATTCATCCAGAGTTCTTAAAGTTTAAAGTTGCA 
CATGATTGTATAAGCATGCTTTCTTTGAGTTTTAAATTATGTATAAACATAAGTTGCATTTAGAA 
ATCAAGCATAAATCACTTCAACTGCAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 8 



MQRLGATLLCLLLAAAVPTAPAPAPTATSAPWPGPALSYPQEEATLNEMFREVEELMEDTQHKL 
RSAVEEMEAEEAAAKAS SE VNLANLPP S YHNETNTDTKVGNNT IHVHRE I HKITNNQTGQMVFSE 
TVITSVGDEEGRRSHECIIDEDCGPSMYCQFASFQYTCQPCRGQRMLCTRDSECCGDQLCVWGHC 
TKMATRGSNGTICDNQRDCQPGLCCAFQRGLLFPVCTPLPVEGELCHDPASRLLDLITWELEPDG 
ALDRCPCASGLLCQPHSHSLVYVCKPTFVGSRDQDGEILLPREVPDEYEVGSFMEEVRQELEDLE 
RS LTEEMALGE PAAAAAALLGGEE I 

Signal sequence: 

amino acids 1-19 

N-glycosylation site. 

amino acids 96-100, 106-110, 121-125, 204-208 
Casein kinase II phosphorylation site. 

amino acids 46-50, 67-71, 98-102, 135-139, 206-210, 312-316, 
327-331 

N -myr i s t oyl a t i on s i te . 

amino acids 202-208, 217-223 

Ami da t ion site . 

amino acids 140-144 
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FIGURE Q 



CGGACGCGTGGGCGGACGCGTGGGGGCTGTGAGAAAGTGCCAATAAATACATCATGCAACCCCAC 

GGCCCACCTTGTGAACTCCTCGTGCCCAGGGCTGATGTGCGTCTTCCAGGGCTACTCATCCAAAG 

GCCTAATCCAACGTTCTGTCTTCAATCTGCAAATCTATGGGGTCCTGGGGCTCTTCTGGACCCTT 

AACTGGGTACTGGCCCTGGGCCAATGCGTCCTCGCTGGAGCCTTTGCCTCCTTCTACTGGGCCTT 

CCACAAGCCCCAGGACATCCCTACCTTCCCCTTAATCTCTGCCTTCATCCGCACACTCCGTTACC 

ACACTGGGTCATTGGCATTTGGAGCCCTCATCCTGACCCTTGTGCAGATAGCCCGGGTCATCTTG 

GAGTATATTGACCACAAGCTCAGAGGAGTGCAGAACCCTGTAGCCCGCTGCATCATGTGCTGTTT 

CAAGTGCTGCCTCTGGTGTCTGGAAAAATTTATCAAGTTCCTAAACCGCAATGCATACATCATGA 

TCGCCATCTACGGGAAGAATTTCTGTGTCTCAGCCAAAAATGCGTTCATGCTACTCATGCGAAAC 

ATTGTCAGGGTGGTCGTCCTGGACAAAGTCACAGACCTGCTGCTGTTCTTTGGGAAGCTGCTGGT 

GGTCGGAGGCGTGGGGGTCCTGTCCTTCTTTTTTTTCTCCGGTCGCATCCCGGGGCTGGGTAAAG 

ACTTTAAGAGCCCCCACCTCAACTATTACTGGCTGCCCATCATGACCTCCATCCTGGGGGCCTAT 

GTCATCGCCAGCGGCTTCTTCAGCGTTTTCGGCATGTGTGTGGACACGCTCTTCCTCTGCTTCCT 

GGAAGACCTGGAGCGGAACAACGGCT CCCTGGACCGGC CC TACT ACAT GT C CAAGAGCC T TC TAA 

AGATTCTGGGCAAGAAGAACGAGGCGCCCCCGGACAACAAGAAGAGGAAGAAGTGACAGCTCCGG 

CCCTGATCCAGGACTGCACCCCACCCCCACCGTCCAGCCATCCAACCTCACTTCGCCTTACAGGT 

CTCCATTTTGTGGTAAAAAAAGGTTTTAGGCCAGGCGCCGTGGCTCACGCCTGTAATCCAACACT 

TTGAGAGGCTGAGGCGGGCGGATCACCTGAGTCAGGAGTTCGAGACCAGCCTGGCCAACATGGTG 

AAACCTCCGTCTCTATTAAAAATACAAAAATTAGCCGAGAGTGGTGGCATGCACCTGTCATCCCA 

GCTACTCGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCGGGAGGCAGAGGTTGCAGTGAGCCGA 

GATCGCGCCACTGCAC T CCAACCTGGGT GACAGAC T CTGT CTCCAAAACAAAACAAACAAACAAA 

AAGAT T T TAT T AAAGAT AT T T T GT T AAC T C 
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FIGURE 10 



RTRGRTRGGCEKVPINTSCNPTAHLVNSSCPGLMCVFQGYSSKGLIQRSVFNLQIYGVLGLFWTL 
NWVLALGQCVLAGAFASFYWAFHKPQDI PTFPL I SAFIRTLRYHTGSLAFGALILTLVQIARVI L 
E YI DHKLRGVQNPVARC IMCCFKCCLWCLEKFIKFLNRNAY IMI AI YGKNFCVSAKNAFMLLMRN 
IVRWVLDKVTDLLLFFGKLLWGGVGVLSFFFFSGRIPGLGKDFKSPHLNYYWLPIMTSILGAY 
VIASGFFSVFGMCVDTLFLCFLEDLERNNGSLDRPYYMSKSLLKILGKKNEAPPDNKKRKK 

| Important features: 
Q Transmembrane domains: 

O amino acids 57-80 (type II), 110-126, 215-231, 254-274 

i?l N-glycosylation sites. 

jft amino acids 16-20, 27-31, 289-293 

^ Hypothetical YBR002c family proteins. 

y amino acids 276-288 

Iff! Ammonium transporters proteins. 

O amino acids 204-231 

111 

N-myristoylation sites. 

amino acids 60-66, 78-84 

Amidation site. 

amino acids 306-310 
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FIGURE ll 



O 



GCCCCGCGCCCGGCGCCGGGCGCCCGAAGCCGGGAGCCACCGCCATGGGGGCCTGCCTGGGAGCCTGC 
TCCCTGCTCAGCTGCGCGTCCTGCCTCTGCGGCTCTGCCCCCTGCATCCTGTGCAGCTGCTGCCCCGC 
CAGCCGCAACTCCACCGTGAGCCGCCTCATCTTCACGTTCTTCCTCTTCCTGGGGGTGCTGGTGTCCA 
TCATTATGCTGAGCCCGGGCGTGGAGAGTCAGCTCTACAAGCTGCCCTGGGTGTGTGAGGAGGGGGCC 
GGGATCCCCACCGTCCTGCAGGGCCACATCGACTGTGGCTCCCTGCTTGGCTACCGCGCTGTCTACCG 
CATGTGCTTCGCCACGGCGGCCTTCTTCTTCTTCTTTTTCACCCTGCTCATGCTCTGCGTGAGCAGCA 
GCCGGGACCCCCGGGCTGCCATCCAGAATGGGTTTTGGTTCTTTAAGTTCCTGATCCTGGTGGGCCTC 
ACCGTGGGTGCCTTCTACATCCCTGACGGCTCCTTCACCAACATCTGGTTCTACTTCGGCGTCGTGGG 
CTCCTTCCTCTTCATCCTCATCCAGCTGGTGCTGCTCATCGACTTTGCGCACTCCTGGAACCAGCGGT 
CR GGCTGGGCAAGGCCGAGGAGTGCGATTCCCGTGCCTGGTACGCAGGCCTCTTCTTCTTCACTCTCCTC 
PI TTCTACTTGCTGTCGATCGCGGCCGTGGCGCTGATGTTCATGTACTACACTGAGCCCAGCGGCTGCCA 
CGAGGGCAAGGTCTTCATCAGCCTCAACCTCACCTTCTGTGTCTGCGTGTCCATCGCTGCTGTCCTGC 
J7 CCAAGGTCCAGGACGCCCAGCCCAACTCGGGTCTGCTGCAGGCCTCGGTCATCACCCTCTACACCATG 
I TTTGTCACCTGGTCAGCCCTATCCAGTATCCCTGAACAGAAATGCAACCCCCATTTGCCAACCCAGCT 
PI GGGCAACGAGACAGTTGTGGCAGGCCCCGAGGGCTATGAGACCCAGTGGTGGGATGCCCCGAGCATTG 
|fl TGGGCCTCATCATCTTCCTCCTGTGCACCCTCTTCATCAGTCTGCGCTCCTCAGACCACCGGCAGGTG 
Q AACAGCCTGATGCAGACCGAGGAGTGCCCACCTATGCTAGACGCCACACAGCAGCAGCAGCAGCAGGT 
GGCAGCCTGTGAGGGCCGGGCCTTTGACAACGAGCAGGACGGCGTCACCTACAGCTACTCCTTCTTCC 
J* ACTTCTGCCTGGTGCTGGCCTCACTGCACGTCATGATGACGCTCACCAACTGGTACAAGCCCGGTGAG 
ACCCGGAAGATGATCAGCACGTGGACCGCCGTGTGGGTGAAGATCTGTGCCAGCTGGGCAGGGCTGCT 
CCTCTACCTGTGGACCCTGGTAGCCCCACTCCTCCTGCGCAACCGCGACTTCAGCTGAGGCAGCCTCA 
CAGCCTGCCATCTGGTGCCTCCTGCCACCTGGTGCCTCTCGGCTCGGTGACAGCCAACCTGCCCCCTC 
CCCACACCAATCAGCCAGGCTGAGCCCCCACCCCTGCCCCAGCTCCAGGACCTGCCCCTGAGCCGGGC 
CTTCTAGTCGTAGTGCCTTCAGGGTCCGAGGAGCATCAGGCTCCTGGAGAGCCCCATCCCCCCGCCAC 
ACCCACACGGTGGAGCTGCCTCTTCCTTCCCCTCCTCCCTGTTGCCCATACTCAGCATCTCGGATGAA 
AGGGCTCCCTTGTCCTCAGGCTCCACGGGAGCGGGGCTGCTGGAGAGAGCGGGGAACTCCCACCACAG 
TGGGGCATCCGGCACTGAAGCCCTGGTGTTCCTGGTCACGTCCCCCAGGGGACCCTGCCCCCTTCCTG 
GACTTCGTGCCTTACTGAGTCTCTAAGACTTTTTCTAATAAACAAGCCAGTGCGTGTAAAAAAAA 
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FIGURE 12 



MGACLGACSLLSCASCLCGSAPCILCSCCPASRNSTVSRLIFTFFLFLGVLVSIIMLSPGVESQL 
YKLPWVCEEGAGIPTVLQGHIDCGSLLGYRAVYRMCFATAAFFFFFFTLLMLCVSSSRDPRAAIQ 
NGFWFFKFLILVGLTVGAFYIPDGSFTNIWFYFGWGSFLFILIQLVLLIDFAHSWNQRWLGKAE 
ECDSRAWYAGLFFFTLLFYLLSIAAVALMFMYYTEPSGCHEGKVFISLNLTFCVCVSIAAVLPKV 
QDAQPNSGLLQASVITLYTMFVTWSALSSIPEQKCNPHLPTQLGNETWAGPEGYETQWWDAPSI 
VGLIIFLLCTLFISLRSSDHRQVNSLMQTEECPPMLDATQQQQQQVAACEGRAFDNEQDGVTYSY 
SFFHFCLVIASLHVMMTLTNWYKPGETRKMISTWTAVWKICASWAGLLLYLWTLVAPLLLRNRD 

FS 

Signal sequence: 

amino acids 1-20 

Transmembrane domains: 

amino acids 40-58, 101-116, 134-150, 162-178, 206-223, 240-257, 
272-283, 324-340, 391-406, 428-444 
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FIGURE 13 

CGGGCCAGCCTGGGGCGGCCGGCCAGGAACCACCCGTTAAGGTGTCTTCTCTTTAGGGATGGTGA 
GGTTGGAAAAAGACTCCTGTAACCCTCCTCCAGGATGAACCACCTGCCAGAAGACATGGAGAACG 
CTCTCACCGGGAGCCAGAGCTCCCATGCTTCTCTGCGCAATATCCATTCCATCAACCCCACACAA 
C T CAT GGC CAGGAT T GAGT CCTATGAAGGAAGGGAAAAGAAAGGCATAT C TGAT GT CAGGAGGAC 
TTTCTGTTTGTTTGTCACCTTTGACCTCTTATTCGTAACATTACTGTGGATAATAGAGTTAAATG 
TGAATGGAGGCATTGAGAACACATTAGAGAAGGAGGTGATGCAGTATGACTACTATTCTTCATAT 
TTTGATATATTTCTTCTGGCAGTTTTTCGATTTAAAGTGTTAATACTTGCATATGCTGTGTGCAG 
ACTGCGCCATTGGTGGGCAATAGCGTTGACAACGGCAGTGACCAGTGCCTTTTTACTAGCAAAAG 
TGATCCTTTCGAAGCTTTTCTCTCAAGGGGCTTTTGGCTATGTGCTGCCCATCATTTCATTCATC 
CTTGCCTGGATTGAGACGTGGTTCCTGGATTTCAAAGTGTTACCTCAAGAAGCAGAAGAAGAAAA 
CAGACTCCTGATAGTTCAGGATGCTTCAGAGAGGGCAGCACTTATACCTGGTGGTCTTTCTGATG 
GT CAGT T T TAT TCCCCTCCT GAAT CCGAAGCAGGATCTGAAGAAGCT GAAGAAAAACAGGAC AG T 
G AGAAAC C AC T T T T AGAAC T ATGAG T AC T AC T T T T G T T AAAT G T G AAAAAC C C T C AC AG AAAG T C 
ATCGAGGCAAAAAGAGGCAGGCAGTGGAGTCTCCCTGTCGACAGTAAAGTTGAAATGGTGACGTC 
CACTGCTGGCTT TATTGAACAGCTAATAAAGATTTATT TAT TGTAATACCTCACAAACGT TGTAC 
CATATCCATGCACATTTAGTTGCCTGCCTGTGGCTGGTAAGGTAATGTCATGATTCATCCTCTCT 
TCAGTGAGACTGAGCCTGATGTGTTAACAAATAGGTGAAGAAAGTCTTGTGCTGTATTCCTAATC 
AAAAG AC T T AAT AT AT T GAAG T AAC AC T T T T T TAG T AAGCAAGAT AC C T T T T TAT T T C AAT T C AC 
AGAATGGAATTTTTTTGTTTCATGTCTCAGATTTATTTTGTATTTCTTTTTTAACACTCTACATT 
TCCCTTGT T T TT TAAC T CATGCACAT GT GC T CT T T GTACAG T T T TAAAAAGTGTAATAAAATC T G 
ACATGTCAATGTGGCTAGTTTTATTTTTCTTGTTTTGCATTATGTGTATGGCCTGAAGTGTTGGA 
CTTGCAAAAGGGGAAGAAAGGAATTGCGAATACATGTAAAATGTCACCAGACATTTGTATTATTT 
TTATCATGAAATCATGTTTTTCTCTGATTGTTCTGAAATGTTCTAAATACTCTTATTTTGAATGC 
ACAAAATGACTTAAACCATTCATATCATGTTTCCTTTGCGTTCAGCCAATTTCAATTAAAATGAA 
CT AAAT T AAAAA 



FIGURE 14 

MNHLPEDMENALTGSQSSHASLRNIHSINPTQLMARIESYEGREKKGISDVRRTFCLFVTFDLLF 
VTLLWI IELNVNGGIENTLEKEVMQYDYYSSYFDI FLIAVFRFKVLILAYAVCRLRHWWAIALTT 
AVTSAFLLAKVILSKLFSQGAFGWLPIISFILAWIETWFLDFKVLPQEAEEENRLLIVQDASER 
AALIPGGLSDGQFYSPPESEAGSEEAEEKQDSEKPLLEL 

Important features of the protein: 
Signal peptide: 

amino acids 1-20 

Transmembrane domains: 

amino acids 54-72, 100-118, 130-144, 146-166 
N-myristoylation sites. 

amino acids 14-20, 78-84, 79-85, 202-208, 217-223 
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FIGURE is 

ACTCGAACGCAGTTGCTTCGGGACCCAGGACCCCCTCGGGCCCGACCCGCCAGGAAAGACTGAGG 
CCGCGGCC TGCCCCGCCCGGCTCCCT GCGCCGCCGCCGCC TCCCGGGACAGAAGATGT GCTCCAG 
GGTCCCTCTGCTGCTGCCGCTGCTCCTGCTACTGGCCCTGGGGCCTGGGGTGCAGGGCTGCCCAT 
CCGGCTGCCAGTGCAGCCAGCCACAGACAGTCTTCTGCACTGCCCGCCAGGGGACCACGGTGCCC 
CGAGACGTGCCACCCGACACGGTGGGGCTGTACGTCTTTGAGAACGGCATCACCATGCTCGACGC 
AGGCAGCTTTGCCGGCCTGCCGGGCCTGCAGCTCCTGGACCTGTCACAGAACCAGATCGCCAGCC 
TGCCCAGCGGGGTCTTCCAGCCACTCGCCAACCTCAGCAACCTGGACCTGACGGCCAACAGGCTG 
CATGAAATCACCAATGAGACCTTCCGTGGCCTGCGGCGCCTCGAGCGCCTCTACCTGGGCAAGAA 
CCGCATCCGCCACATCCAGCCTGGTGCCTTCGACACGCTCGACCGCCTCCTGGAGCTCAAGCTGC 
AGGACAACGAGCTGCGGGCACTGCCCCCGCTGCGCCTGCCCCGCCTGCTGCTGCTGGACCTCAGC 
^ CACAACAGCCTCCTGGCCCTGGAGCCCGGCATCCTGGACACTGCCAACGTGGAGGCGCTGCGGCT 
P GGCTGGTCTGGGGCTGCAGCAGCTGGACGAGGGGCTCTTCAGCCGCTTGCGCAACCTCCACGACC 
f!l TGGATGTGTCCGACAACCAGCTGGAGCGAGTGCCACCTGTGATCCGAGGCCTCCGGGGCCTGACG 
m CGCCTGCGGCTGGCCGGCAACACCCGCATTGCCCAGCTGCGGCCCGAGGACCTGGCCGGCCTGGC 
Z l TGCCCTGCAGGAGCTGGATGTGAGCAACCTAAGCCTGCAGGCCCTGCCTGGCGACCTCTCGGGCC 
Y£ TCTTCCCCCGCCTGCGGCTGCTGGCAGCTGCCCGCAACCCCTTCAACTGCGTGTGCCCCCTGAGC 
Ml TGGTTTGGCCCCTGGGTGCGCGAGAGCCACGTCACACTGGCCAGCCCTGAGGAGACGCGCTGCCA 
£p CTTCCCGCCCAAGAACGCTGGCCGGCTGCTCCTGGAGCTTGACTACGCCGACTTTGGCTGCCCAG 
l-A: C CACCAC C AC CACAGC CAC AGT G C CC AC CAC GAG GCCC G T GG T GC G GGAGC C CAC AGC C T T G T C T 

TCTAGCTTGGCTCCTACCTGGCTTAGCCCCACAGCGCCGGCCACTGAGGCCCCCAGCCCGCCCTC 
*.„ CACTGCCCCACCGACTGTAGGGCCTGTCCCCCAGCCCCAGGACTGCCCACCGTCCACCTGCCTCA 
V«l ATGGGGGCACATGCCACCTGGGGACACGGCACCACCTGGCGTGCTTGTGCCCCGAAGGCTTCACG 
If! GGCCT G TAC T GT GAGAGCCAGAT GGGGCAGGGGACACGGCCCAGCCCTACACCAGT CACGCCGAG 

p GCCACCACGGTCCCTGACCCTGGGCATCGAGCCGGTGAGCCCCACCTCCCTGCGCGTGGGGCTGC 
I** AGCGCTACCTCCAGGGGAGCTCCGTGCAGCTCAGGAGCCTCCGTCTCACCTATCGCAACCTATCG 
i™ GGCCCTGATAAGCGGCTGGTGACGCTGCGACTGCCTGCCTCGCTCGCTGAGTACACGGTCACCCA 
U GCTGCGGCCCAACGCCACTTACTCCGTCTGTGTCATGCCTTTGGGGCCCGGGCGGGTGCCGGAGG 
fll GCGAGGAGGCCTGCGGGGAGGCCCATACACCCCCAGCCGTCCACTCCAACCACGCCCCAGTCACC 
CAGGCCCGCGAGGGCAACCTGCCGCTCCTCATTGCGCCCGCCCTGGCCGCGGTGCTCCTGGCCGC 
GCTGGCTGCGGTGGGGGCAGCCTACTGTGTGCGGCGGGGGCGGGCCATGGCAGCAGCGGCTCAGG 
ACAAAGGGCAGGTGGGGCCAGGGGCTGGGCCCCTGGAACTGGAGGGAGTGAAGGTCCCCTTGGAG 
CCAGGCCCGAAGGCAACAGAGGGCGGTGGAGAGGCCCTGCCCAGCGGGTCTGAGTGTGAGGTGCC 
ACTCATGGGCTTCCCAGGGCCTGGCCTCCAGTCACCCCTCCACGCAAAGCCCTACATCTAAGCCA 
GAGAGAGACAGGGCAGCTGGGGCCGGGCTCTCAGCCAGTGAGATGGCCAGCCCCCTCCTGCTGCC 
AC AC CAC GTAAGTTCTCAGTCCCAACCTCGGGGATGTGTGCAGACAGGGCTGTGTGAC CACAGC T 
GGGCCCTGTTCCCTCTGGACCTCGGTCTCCTCATCTGTGAGATGCTGTGGCCCAGCTGACGAGCC 
CTAACGTCCCCAGAACCGAGTGCCTATGAGGACAGTGTCCGCCCTGCCCTCCGCAACGTGCAGTC 
CCTGGGCACGGCGGGCCCTGCCATGTGCTGGTAACGCATGCCTGGGTCCTGCTGGGCTCTCCCAC 
TCCAGGCGGACCCTGGGGGCCAGTGAAGGAAGCTCCCGGAAAGAGCAGAGGGAGAGCGGGTAGGC 
GGC T GTGTGAC T C TAGT C T TGGCCCCAGGAAGCGAAGGAACAAAAGAAAC T GGAAAGGAAGATGC 
T T TAGGAACAT GT T T TGC T T T TT T AAAATATATAT ATT TATAAGAGATCC TT T C CCAT T TAT T C T 
GGGAAGATGT T T T T CAAAC TCAGAGACAAGGACT TTGGTTTTT GT AAGACAAACGAT GATATGAA 
GGCC TT T TG TAAGAAAAAATAAAAGATGAAGT GT GAAA 
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FIGURE 16 



MCSRVPLLLPLLLLLALGPGVQGCPSGCQCSQPQTVFCTARQGTTVPRDVPPDTVGLYVFENGIT 
MLDAGSFAGLPGLQLLDLSQNQIASLPSGVFQPLANLSNLDLTANRLHEITNETFRGLRRLERLY 
LGKNRIRHIQPGAFDTLDRLLELKLQDNELRALPPLRLPRLLLLDLSHNSLLALEPGILDTANVE 
ALRLAGLGLQQLDEGLFSRLRNLHDLDVSDNQLERVPPVIRGLRGLTRLRLAGNTRIAQLRPEDL 
AGIAALQELDVSNLSLQALPGDLSGLFPRLRLIAAARNPFNCVCPLSWFGPWVRESHVTLASPEE 
TRCHFPPKNAGRLLLELDYADFGCPATTTTATVPTTRPWREPTALSSSLAPTWLSPTAPATEAP 
SPPSTAPPTVGPVPQPQDCPPSTCLNGGTCHLGTRHHLACLCPEGFTGLYCESQMGQGTRPSPTP 
VTPRPPRSLTLGIEPVSPTSLRVGLQRYLQGSSVQLRSLRLTYRNLSGPDKRLVTLRLPASLAEY 
TVTQLRPNATYSVCVMPLGPGRVPEGEEACGEAHTPPAVHSNHAPVTQAREGNLPLLIAPALAAV 
LLAALAAVGAAYCTORGRAMAAAAQDKGQVGPGAGPLELEGVKVPLEPGPKATEGGGEALPSGSE 

CEVPLMGFPGPGLQSPLHAKPYI 

Important features: 
Signal peptide: 

amino acids 1-23 
Transmembrane domain: 

amino acids 579-599 

EGF-like domain cysteine pattern signature. 

amino acids 430-442 
Leucine zipper pattern, 
amino acids 197-219, 269-291 
N-glycosylation sites. 

amino acids 101-105, 117-121, 273-277, 500-504, 528-532 
Tyrosine kinase phosphorylation sites, 
amino acids 124-131, 337-345 
N-myristoylation sites. 

amino acids 23-29, 27-33, 70-76, 142-148, 187-193, 348-354, 
594-600, 640-646 
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FIGURE 17 



GCAGCGGCGAGGCGGCGGTGGTGGCTGAGTCCGTGGTGGCAGAGGCGAAGGCGACAGCTCATGCG 
GGTCCGGATAGGGCTGACGCTGCTGCTGTGTGCGGTGCTGCTGAGCTTGGCCTCGGCGTCCTCGG 
ATGAAGAAGGCAGCCAGGATGAATCCTTAGATTCCAAGACTACTTTGACATCAGATGAGTCAGTA 
AAGGACCATACTACT GCAGGCAGAGTAGTTGCT GGT CAAATAT T TCTTGAT TCAGAAGAATC TGA 
ATTAGAATCCT C TATTCAAGAAGAGGAAGACAGCCT CAAGAGCCAAGAGGGGGAAAGT GTCACAG 
AAGAT ATCAGCT TTCTAGAGTCTCCAAA.TCCAGAAAACAAGGACTAT GAAGAGCCAAAGAAAGTA 
CGGAAACCAGCTTTGACCGCCATTGAAGGCACAGCACATGGGGAGCCCTGCCACTTCCCTTTTCT 
?Z TTTCCTAGATAAGGAGTATGATGAATGTACATCAGATGGGAGGGAAGATGGCAGACTGTGGTGTG 
O CT ACAACC TAT G ACT ACAAAGCAGAT GAAAAGT GGGGC T T T T GT GAAAC T GAAG AAGAGGCT GC T 

AAGAGACGGCAGAT GCAGGAAGCAGAAATGAT GTAT CAAAC TGGAATGAAAAT CCT TAATGGAAG 
W CAATAAGAAAAGCCAAAAAA.GAGAAGCATATCGGTATCTCCAAAAGGCAGCAAGCATGAACCATA 
% CCAAAGCCCTGGAGAGAGTGTCATATGCTCTTT TATT TGGTGAT T ACT TGCCACAGAATATCCAG 

Tl GCAGCGAGAGAGATGT T TGAGAAGC TGACTGAGGAAGGC TC T CCCAAGGGACAGACTGC TCT T GG 

a CTTTCTGTATGCCTCTGGACTTGGTGTTAATTCAAGTCAGGCAAAGGCTCTTGTATATTATACAT 
O T TGGAGCTCTTGGGGGCAATCTAATAGCCCACAT GGTTT TGGTAAGTAGACT TTAGTGGAAGGCT 

AATAATATTAACATCAGAAGAATTTGTGGTTTATAGCG^^^ 

K CAGAT T TGC T TGTAT TAAGAC CAAATATTCAGT T GAAC T TCC T TCAAAT TCT T GT T AAT GGAT AT 

III 

H AACACAT GGAAT CTACATGTAAATGAAAGT T GGT GGAGTCCACAAT TT T TCT T TAAAATGAT TAG 

f|| TTTGGCTGATTGCCCCTAAAAAGAGAGATCTGATAAATGGCTCTTTTTAAATTTTCTCTGAGTTG 
GAATTGTCAGAATCATTTTTTACATTAGATTATCATAATTTTAAAAATTTTTCTTTAGTTTTTCA 
AAAT TT TGTAAATGGTGGCTAT AGAAAAACAACAT GAAATAT TAT ACAATAT T T T GCAACAATGC 
CCTAAGAATTGTTAAAATTCATGGAGTTATTTGTGCAGAATGACTCCAGAGAGCTCTACTTTCTG 
TTTTTTACTTTTCATGATTGGCTGTCTTCCCATTTATTCTGGTCATTTATTGCTAGTGACACTGT 
GCCTGCTTCCAGTAGTCTCATTTTCCCTATTTTGCTAATTTGTTACTTTTTCTTTGCTAATTTGG 
AAGAT TAACT CAT T TT TAATAAAAT T AT GTCTAAGATTAAAAAAAA&APAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 18 



MRWIGLTLLLC^VLLSIASASSDEEGSQDESLDSKTTLTSDESWDHTTAGRWAGQIFLDSEESEL 
ESSIQEEEDSLKSQEGESVTEDISFLESPNPENKDYEEPKKVRKPALTAIEGTAHGEPCHFPFLFLDK 
EYDECTSDGREDGRLWC^TTYDYKADEKWGFCETEEEAAK^ 

EAYRYLQKAASMNHTKALERVSYALLFGDYLPQNIQAAREMFEKLTEEGSPKGQTALGFLYASGLGVN 
S SQAKALVYYT FGALGGNLI AHMVLVSRL 



IP 



Important features: 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites. 

amino acids 195-199, 217-221, 272-276 

Tyrosine kinase phosphorylation site. 

amino acids 220-228 

N-myristoylation sites. 

amino acids 120-126, 253-259, 268-274, 270-274, 285-291, 289-295 

Glycosaminoglycan attachment site. 

amino acids 267-271 



Microbodies C-terminal targeting signal. 

amino acids 299-303 

Type II fibronectin collagen -binding domain protein. 

amino acids 127-169 

Fructose-bisphosphate aldolase class-II protein. 

amino acids 101-119 
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FIGURE 1Q 



AATTCAGATTTTAAGCCCATTCTGCAGTGGAATTTCATGAACTAGCAAGAGGACACCATCTTCTT 

GTATTATACAAGAAAGGAGTGTACCTATCACACACAGGGGGAAAAATGCTCT TTT GGGTGCTAGG 

CCTCCTAATCCTCTGTGGTTTTCTGTGGACTCGTAAAGGAAAACTAAAGATTGAAGACATCACTG 

ATAAGTACATTTTTATCACTGGATGTGACTCGGGCTTTGGAAACTTGGCAGCCAGAACTTTTGAT 

AAA?^AGGGATTTCATGTAATCGCTGCCTGTCTGACTGAATCAGGATCAACAGCTTTAAAGGCAGA 

AACCTCAGAGAGACTTCGTACTGTGCTTCTGGATGTGACCGACCCAGAGAATGTCAAGAGGACTG 

CCCAGTGGGTGAAGAACCAAGTTGGGGAGAAAGGTCTCTGGGGTCTGATCAATAATGCTGGTGTT 

CCCGGCGTGCTGGCTCCCACTGACTGGCTGACACTAGAGGACTACAGAGAACCTATTGAAGTGAA 

CCT GT T TGGACT CAT CAGT G TGACAC TAAATAT GCTTCCTT TGGT CAAGAAAGCT CAAGGGAGAG 

TTATTAATGTCTCCAGTGTTGGAGGTCGCCTTGCAATCGTTGGAGGGGGCTATACTCCATCCAAA 

T AT GCAG T GGAAGGT T T CAATGACAGC T TAAGACGGGACAT GAAAGCT T T T GGT GTGC ACGT C T C 

ATGCATTGAACCAGGATTGT TCAAAACAAA.C T TGGCAGATCCAG TAAAGGTAAT T GAAAAAAAAC 

TCGCCATTTGGGAGCAGCTGTCTCCAGACATCAAACAACAATATGGAGAAGGTTACATTGAAAAA 

AGTCTAGACAAACTGAAAGGCAATAAATCCTATGTGAACATGGACCTCTCTCCGGTGGTAGAGTG 

CATGGACCACGCTCTAACAAGTCTCTTCCCTAAGACTCATTATGCCGCTGGAAAAGATGCCAAAA 

TTTTCTGGATACCTCTGTCTCACATGCCAGCAGCTTTGCAAGACTTTTTATTGTTGAAACAGAAA 

GCAGAGCTGGCTAATCCCAAGGCAGTGTGACTCAGCTAACCACAAATGTCTCCTCCAGGCTATGA 

AATTGGCCGATTTCAAGAACACATCTCCTTTTCAACCCCATTCCTTATCTGCTCCAACCTGGACT 

CATTTAGATCGTGCTTATTTGGATTGCAAAAGGGAGTCCCACCATCGCTGGTGGTATCCCAGGGT 

CCCTGCTCAAGTTTTCTTTGAAAAGGAGGGCTGGAATGGTACATCACATAGGCAAGTCCTGCCCT 

GTATTTAGGCTTTGCCTGCTTGGTGTGATGTAAGGGAAATTGAAAGACTTGCCCATTCAAAATGA 

TCTTTACCGTGGCCTGCCCCATGCTTATGGTCCCCAGCATTTACAGTAACTTGTGAATGTTAAGT 

AT CAT C T C T TAT C TAAATAT T AAAAGAT AAG T CAACCCAAAAAAAAAAA?\A?\AAA7\AAAAAAAAk 

AAAAAAAAAAAAA 



FIGURE 20 



ML FWVLGLL I LCG FLWTRKGKLKIEDI TDKY I FI TGCDSGFGNLAART FDKKGFHVI AACLTES G 
STALKAETSERLRTVLLDVTDPENVKRTAQWVKNQVGEKGLWGLINNAGVPGVLAPTDWLTLEDY 
RE PIEVNL FGLI SVTLNMLPLVKKAQGRVINVS SVGGRLAI VGGGYTPSKYAVEGFNDS LRRDMK 
AFGVHVSCIEPGLFKTNLADPVKVIEKKLAIWEQLSPDIKQQYGEGYIEKSLDKLKGNKSYVNMD 
LSPVVECMDHALTSLFPKTHYAAGKDAKIFWIPLSHMPAALQDFLLLKQKAEl^PKAV 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 136-152 

N-glycosylation sites. 

amino acids 161-163, 187-190 and 253-256 

Glycosaminoglycan attachment site. 

amino acids 39-42 

N-myristoylation sites. 

amino acids 36-41, 42-47, 108-113, 166-171, 198-203 and 207-212 
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FIGURE 21 



CTGAGGCGGCGGTAGCATGGAGGGGGAGAGTACGTCGGCGGTGCTCTCGGGCTTTGTGCTCGGCG 
CACTCGCTTTCCAGCACCTCAACACGGACTCGGACACGGAAGGTTTTCTTCTTGGGGAAGTAAAA 
GGTGAAGCGAAGAACAGCATTACTGATTCCCAAATGGATGATGTTGAAGTTGTTTATACAATTGA 
CAT TCAGAAA.TAT AT TCCATGCTATCAGCT TTT TAGCT TT TATAATT CTTCAGGCGAAGTAAAT G 
AGCAAGCACTGAAGAAAATATTATCAAATGTCAAAAAGAATGTGGTAGGTTGGTACAAATTCCGT 
CGTCATTCAGATCAGATCATGACGTTTAGAGAGAGGCTGCTTCACAAAAACTTGCAGGAGCATTT 
j,^ T T CAAACCAAGACCTT GT T T TT CT GCTAT TAACACCAAGTATAATAACAGAAAGCT GCT CTAC TC 

O AT CGAC T GGAAC AT T CC T TATATAAACC T CAAAAAGGAC TTT TT C ACAGGGT ACC T TTAG T GGT T 

Cl GCCAATCTGGGCATGTCTGAACAACTGGGTTATAAAACTGTATCAGGTTCCTGTATGTCCACTGG 
*f m I T T TT AGCCGAGCAGTACAAACACACAGC TC TAAAT TTT T TGAAGAAGAT GGATCCT T AAAGGAGG 

|?| TAC AT AAGATAAAT GAAAT GT AT GC TT CAT TACAAGAGGAAT T AAAGAGT AT AT GCAA&AAAGT G 

Ifl GAAGACAGT GAACAAGCAGT AGAT AAAC TAG TAAAGGAT GTAAACAGAT TAAAACGAGAAATTGA 

H GAAAAGGAGAGGAGCACAGATTCAGGCAGCAAGAGAGAAGAACATCCAAAAAGACCCTCAGGAGA 
ACATTTTTCTTTGTCAGGCATTACGGACCTTTTTTCCAAATTCTGAATTTCTTCATTCATGTGTT 

?:f AT GT CT T TAAAAAAT AGACATGT T T CTAAAAGT AGC T GTAACT ACAACCACCATCT CGAT GTAGT 

y'i 

f4 AGACAATCTGACCT TAATGGTAGAACACAC TGACATTCC TGAAGCTAGTCCAGCTAGTACACCAC 

f|| AAAT CAT TAAGCAT AAAGCC T TAGACT T AGAT GAC AGAT GGCAAT T CAAGAGAT CTCGGTTGT TA 

Q GATACACAAGACAAACGATCTAAAGCAAATACTGGTAGTAGTAACCAAGATAAAGCATCCAA?\AT 
GAGCAGCCCAGAAACAGATGAAGAAAT TGAAAAGATGAAGGGTT TT GGTGAATATTCACGGTCTC 
C TAC AT T T TGA T CCT T T T AAC C T T ACAAGG AG AT T T T T T TAT T T G G C T GAT GGG T AAAGC CAAAC 
ATTTCTATTGTTTTTACTATGTTGAGCTACTTGCAGTAAGTTCATTTGTTTTTACTATGTTCACC 
TGTTTGCAGTAATACACAGATAACTCTTAGTGCATTTACTTCACAAAGTACTTTTTCAAACATCA 

GAT GC T T T TAT T T CCAAACC T T T T T TT CACC TTT CAC T AAGT T G T T GAGGGGAAGGCT T AC ACAG 
ACACATTCTTTAGAATTGGAAAAGTGAGACCAGGCACAGTGGCTCACACCTGTAATCCCAGCACT 
T AGGGAAGACAAGT CAGGAGGATT GAT TGAAGCTAGGAGT TAGAGACCAGCCT GGGCAACGTAT T 
GAG AC CAT G T C TAT TAAAAAAT AAAAT GG AAAAG C AAG AAT AGC C T TAT TTT C AAAAT AT GGAAA 
GAAAT T TAT AT G AAAAT T T AT C T GAG T CAT T AAAAT T C T C C T T AAG T GAT AC T T T T T T AGAAG T A 
CATTATGGCTAGAGTTGCCAGATAAAATGCTGGATATCATGCAATAAATTTGCAAAACATCATCT 

AAAAT TTAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 22 



MEGESTSAVLSGFVLGALAFQHLNTDSDTEGFLLGEVKGEAKNSITDSQMDDVEWYTIDIQKYI 
PCYQLFS FYNSSGEVNEQALKKI LSNVKKNWGWYKFRRHSDQIMT FRERLLHKNLQEHFSNQDL 
VFLLLTPSIITESCSTHRLEHSLYKPQKGLFHRVPLWANLGMSEQLGYKTVSGSCMSTGFSRAV 
QTHS SKFFEEDGSLKEVHK INEMYAS LQEELKS I CKKVEDSEQAVDKLVKDVNRLKRE I EKRRGA 
Q I QAAREKN I QKDPQEN I FLCQALRT FFPNS E FLHS CVMS LKNRHVSKS SCNYNHHLDWDNLTL 
MVEHTDIPEASPASTPQIIKHKALDLDDRWQFKRSRLLDTQDKRSKANTGSSNQDKASKMSSPET 

DEEIEKMKGFGEYSRSPTF 

Important features: 
Signal peptide: 

amino acids 1-19 

N-glycosylation sites. 

amino acids 75-79, 322-326 

N-myristoylation site. 

amino acids 184-154 

Growth factor and cytokines receptors family. 

amino acids 134-150 
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FIGURE 23 



GGCACAGCCGCGCGGCGGAGGGCAGAGTCAGCCGAGCCGAGTCCAGCCGGACGAGCGGACCAGCGCAGGGCAGCCCAA 
GCAGCGCGCAGCGAACGCCCGCCGCCGCCCACACCCTCTGCGGTCCCCGCGGCGCCTGCCACCCTTCCCTCCTTCCCC 
GCGTCCCCGCCTCGCCGGCCAGTCAGCTTGCCGGGTTCGCTGCCCCGCGAAACCCCGAGGTCACCAGCCCGCGCCTCT 
GCTTCCCTGGGCCGCGCGCCGCCTCCACGCCCTCCTTCTCCCCTGGCCCGGCGCCTGGCACCGGGGACCGTTGCCTGA 
CGCGAGGCCCAGCTCTACTTTTCGCCCCGCGTCTCCTCCGCCTGCTCGCCTCTTCCACCAACTCCAACTCCTTCTCCC 
TCCAGCTCCACTCGCTAGTCCCCGACTCCGCCAGCCCTCGGCCCGCTGCCGTAGCGCCGCTTCCCGTCCGGTCCCAAA 
GGTGGGAACGCGTCCGCCCCGGCCCGCACCATGGCACGGTTCGGCTTGCCCGCGCTTCTCTGCACCCTGGCAGTGCTC 
AGCGCCGCGCT GCT GGCT GCC GAGCTCAAGT CGAAAAGT T GCT C GGAAGTG CGACGT CT TTACGT GT CCAAAGG CT T C 
M AACAAGAAC GAT GC CCCCCTCCACGAGAT CAAC GGT GAT CATT T GAAGAT CT GT CCC CAGGGTT CTAC CTGCT G CT CT 

C3. CAAGAGAT GGAGGAGAAGTACAGCCTGCAAAGT AAAGAT GAT TTCAAAAGT GT GGT CAGCGAACAGT GCAAT CATT T G 

IJ CAAGCTGTCTTTGCTTCACGTTACAAGAAGTTTGATGAATTCTT CAAAGAACTACTTGAAAATGCAGAGAAATCCCT G 

iM AAT GAT AT GTTT GT GAAGACATAT GGCC ATTT ATACAT GCAAAATT CT GAGCTATT TAAAGAT CTCTT CGT AGAGT T G 

W AAACGTTACTACGTGGTGGGAAATGTGAACCTGGAAGAAATGCTAAATGACTTCTGGGCTCGCCTCCTGGAGCGGATG 
^ S T TCC GCCT GGT GAACT CCCAGTAC CACTT TACAGAT GAG TAT CT GGAAT GT GT GAGCAAGTATAC GGAG CAGCT GAAG 

m 

Y' a CCCTTCGGAGATGTCCCTCGCAAATTGAAGCTCCAGGTTACTCGTGCTTTTGTAGCAGCCCGTACTTTCGCTCAAGGC 
TTAGCGGTTGCGGGAGATGTCGTGAGCAAGGTCTCCGTGGTAAACCCCACAGCCCAGTGTACCCATGCCCTGTTGAAG 
\ ATGATCTACTGCT C CCACT GC CGGGGT CTC GT GACT GT GAAGCCAT GTT ACAACTACTGCT CAAACAT CAT GAGAGGC 

Jff T GTTTGGCCAACCAAGGGGAT CT CGAT TTTGAATGGAACAAT TT CATAGAT GCT AT GCT GAT GGT G GCAGAGAGGCTA 

'fj GAGGGTCCTTTCAACATTGAATCGGTCATGGATCCCATCGATGTGAAGATTTCTGATGCTATTATGAACATGCAGGAT 

^ AATAGTGT T CAAGT GT CTCAGAAGGT TTT CCAGGGAT GT GGACC CCC CAAGC CC CT CCCAGCTGGACGAAT T T CT CGT 

if! |: 

^5 TCCATCTCTGAAAGTGCCTTCAGTGCTCGCTTCAGACCACATCACCCCGAGGAACGCCCAACCACAGCAGCTGGCACT 

^ AGTT TGGACCGACT GGT TACT GAT GT CAAGGAGAAACTGAAACAGGC CAAGAAAT T CTGGT CCT C C CTT CCGAGC AAC 



GTTT GCAACGAT GAGAG GAT GGCT GCAGGAAACGGCAAT GAG GAT GACT GTT GGAATGGGAAAGGCAAAAGCAGGTAC 
CT GTTT GCAGT GACAG GAAAT GGATTAGCCAACCAG GGCAACAACC CAGAGGTC CAGGTT GACACCAGCAAACCAGAC 
ATACT GAT C CTT CGTCAAAT CATGGCT CT T CGAGT GATGAC CAGCAAGAT GAAGAAT GCATACAAT G GGAAC GACGT G 
GACTT CTTT GATAT CAGTGAT GAAAGTAGT GGAGAAGGAAGT GGAAGTG GCT GT GAGTAT CAGCAGT GCCCTT CAGAG 
T TT GACT ACAATGC CACTGAC CAT GCTGGGAAGAGT GCCAAT GAGAAAG CC GACAGT GCT GGTGT C CGT CCT GGG GCA 
CAGGCCTACCT C CT CACTGT CT TCT GCAT CTT GT T CCTG GT TAT GCAGAGAGAGT GGAGATAATT CT CAAACTCT GAG 
AAAAAGT GTTCATCAAAAAGTTAAAAGG CACCAGT TAT CACTTTT CTAC CAT CCTAGT GACT TT GCTTT T TAAAT GAA 
T G GACAACAAT GTACAGTTTTT ACT AT GT GGCCACT GGTT T AAGAAGT GCT GACT T T GTT TT CT CATTCAGTTT T GGG 
AGGAAAAGGGACTGT GCATT GAGTT GGTT C CTGCT CCCCCAAAC CAT GT TAAAC GT GGCT AACAGT GTAGGT ACAGAA 
CTATAGTTAGT T GT GCATTT GT GATTTTAT CACT CTATTAT TTGTT T GT AT GTTT TTTT CT CATTT CGT TTGT GGGTT 
TTTTTTTCCAACTGTGATCTCGCCTTGTTTCTTACAA.GCAAACCAGGGTCCCTTCTTGGCACGTAACATGTACGTATT 
TCT GAAAT AT TAAAT AGCT GTACAGAAGCAGGTTTTATTT AT CAT GTTAT CT TAT T AAAAGAAAAAGC CCAAAAAGC 
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FIGURE 24 

MARFGLPALLCTIAVLSAALLAAELKSK^ 

CCSQEMEEKYSLQSKDDFKSWSEQCNHLQAVFASRYKKFDEFFKELLENMKSLNDMFVKTYGH 
LYMQNSELFKDLFVELKRYYWGNVNLEEMLNDFWARLLERMFRLVNSQYHFTDEYLECVSKYTE 
QLKPFGDVPRKLKLQVTRAFVAARTFAQGLAVAGDWSKVSWNPTAQCTHALLKMIYCSHCRGL 
VTWPCYNYCSNimGCLANQGDLDFEWNNFIDAMLMVAERLEGPFNIESVMDPIDWISDAI^ 
MQDNSVQVSQKVFQGCGPPKPLPAGRISRSISESAFSARFRPHHPEERPTTAAGTSLDRLVTDVK 
EKLKQAKKFWSSLPSNVCNDERMAAGNGNEDDCWNGKGKSRYLFAVTGNGLANQGNNPEVQVDTS 
KPDILILRQIMALRVMTSKMKNAYNGNDVDFFDISDESSGEGSGSGCEYQQCPSEFDYNATDHAG 
KSANEKADSAGVRPGAQAYLLTVFCI LFLVMQREWR 

Important features: 
Signal peptide: 

amino acids 1-22 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 515-524 

N-glycosylation site. 

amino acids 514-518 

Glycosaminoglycan attachment sites. 

amino acids 494-498, 498-502 

N-myristoylation sites. 

amino acids 63-69, 224-230, 276-282, 438-444, 497-503, 531-537 
Glypicans proteins. 

amino acids 54-75, 105-157, 238-280, 309-346, 423-460, 468-506 
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FIGURE 25 

CTCGCCCTCAAATGGGAACGCTGGCCTGGGACTAAAGCATAGACCACCAGGCTGAGTATCCTGAC 

CT GAGTCAT CCCC AGGGATCAGGAGCC TCCAGCAGGGAACC T T CCAT TATAT TCT TCAAGCAACT 
TACAGCTGCACCGACAGTTGCGATGAAAGTTCTAATCTCTTCCCTCCTCCTGTTGCTGCCACTAA 
TGCTGATGTCCATGGTCTCTAGCAGCCTGAATCCAGGGGTCGCCAGAGGCCACAGGGACCGAGGC 
CAGGCTTCTAGGAGATGGCTCCAGGAAGGCGGCCAAGAATGTGAGTGCAAAGATTGGTTCCTGAG 
AGCCCCGAGAAGAAAATTCATGACAGTGTCTGGGCTGCCAAAGAAGCAGTGCCCCTGTGATCATT 
TCAAGGGCAATGTGAAGAAAACAAGACACCAAAGGCACCACAGAAAGCCAAACAAGCATTCCAGA 
GCCTGCCAGCAATTTCTCAAACAATGTCAGCTAAGAAGCTTTGCTCTGCCTTTGTAGGAGCTCTG 
AGCGCCCACTCTTCCAATTAAACATTCTCAGCCAAGAAGACAGTGAGCACACCTACCAGACACTC 
TTCTTCTCCCACCTCACTCTCCCACTGTACCCACCCCTAAATCATTCCAGTGCTCTCAAAAAGCA 
TGTTTTTCAAGATCATTTTGTTTGTTGCTCTCTCTAGTGTCTTCTTCTCTCGTCAGTCTTAGCCT 
GTGCCCTCCCCTTACCCAGGCTTAGGCTTAATTACCTGAAAGATTCCAGGAAACTGTAGCTTCCT 
AGCTAGTGTCATTTAACCTTAAATGCAATCAGGAAAGTAGCAAACAGAAGTCAATAAATATTTTT 
AAATGTCAAAAAAAAAAAAAAAAAA 
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FIGURE 26 



MKVLISSLLLLLPIJ4LMSMVSSSLNPGVARGHRDRGQASRRWLQEGGQECECKDWFLRAPRRKFM 
TVSGLPKKQCPCDHFKGNVKKTRHQRHHRKPNKHSRACQQFLKQCQLRSFALPL 



Important features: 
Signal peptide: 

amino acids 1-22 

u 

f»h N-myristoylation sites. 

O amino acids 27-33, 46-52 

m 
w 
w 
m 



C3 
If! 

0 

ill 
Q 

rii 
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FIGURE 27 



GGACGCCAGCGCCTGCAGAGGCTGAGCAGGGAAAAAGCCAGTGCCCCAGCGGAAGCACAGCTCAG 

AGCTGGTCTGCCATGGACATCCTGGTCCCACTCCTGCAGCTGCTGGTGCTGCTTCTTACCCTGCC 

CCTGCACCTCATGGCTCTGCTGGGCTGCTGGCAGCCCCTGTGCAAAAGCTACTTCCCCTACCTGA 

T GGCCGTGCTGACTCCCAAGAGCAACCGCAAGAT GGAGAGCAAGAAACGGGAGCTCTTCAGCCAG ' 

ATAAAGGGGCTTACAGGAGCCTCCGGGAAAGTGGCCCTACTGGAGCTGGGCTGCGGAACCGGAGC 

CAACTTTCAGTTCTACCCACCGGGCTGCAGGGTCACCTGCCTAGACCCAAATCCCCACTTTGAGA 

AGTTCCTGACAAAGAGCATGGCT GAGAACAGGCACCTCCAATAT GAGCGGT T TGTGGTGGCTCCT 

GGAGAGGACATGAGACAGCTGGCTGATGGCTCCATGGATGTGGTGGTCTGCACTCTGGTGCTGTG 

CTCTGTGCAGAGCCCAAGGAAGGTCCTGCAGGAGGTCCGGAGAGTACTGAGACCGGGAGGTGTGC 

TCTTTTTCTGGGAGCATGTGGCAGAACCATATGGAAGCTGGGCCTTCATGTGGCAGCAAGTTTTC 

GAGCCCACCTGGAAACACATTGGGGATGGCTGCTGCCTCACCAGAGAGACCTGGAAGGATCTTGA 

GAACGCCCAGTTCTCCGAAATCCAAATGGAACGACAGCCCCCTCCCTTGAAGTGGCTACGTGTTG 

GGCCCCACATCATGGGAAAGGCTGTCAAACAATCTTTCCCAAGCTCCAAGGCACTCATTTGCTCC 

TT CCCCAGCC TCCAAT TAGAACAAGCCACCCACCAGCCTATCTAT CTTCCACTGAGAGGGACCTA 

GCAGAATGAGAGAAGACAT TCAT GTAC CACC T AC TAGT CCCTCT CT CCCCAAC CT C T GCCAGGGC 

AATCTCTAACTTCAATCCCGCCTTCGACAGTGAAAAAGCTCTACTTCTACGCTGACCCAGGGAGG 

AAACACTAGGACCCTGTTGTATCCTCAACTGCAAGTTTCTGGACTAGTCTCCCAACGTTTGCCTC 

CCAATGTTGTCCCTTTCCTTCGTTCCCATGGTAAAGCTCCTCTCGCTTTCCTCCTGAGGCTACAC 

CCATGCGTCTCTAGGAACTGGTCACAAAAGTCATGGTGCCTGCATCCCTGCCAAGCCCCCCTGAC 

CCTCTCTCCCCACTACCACCTTCTTCCTGAGCTGGGGGCACCAGGGAGAATCAGAGATGCTGGGG 

ATGCCAGAGCAAGACTCAAAGAGGCAGAGGTTTTGTTCTCAAATATTTTTTAATAAATAGACGAA 

ACCACG 
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FIGURE 28 



MDILVPLLQLLVLLLTLPLHLMALLGCWQPLCKSYFPYLMAVLTPKSNRKMESKKRELFSQIKGL 
TGASGKVALLELGCGTGANFQFYPPGCRVTCLDPNPHFEKFLTKSMAENRHLQYERFVVAPGEDM 
RQIJUDGSMDVWCTLVLCSVQSPRKVLQ^ 

KHIGDGCCLTRETWKDLENAQFSEIQMERQPPPLKWLPVGPHIMGKAVKQSFPSSKALICSFPSL 
QLEQATHQPIYLPLRGT 

Important features: 
Signal peptide: 

amino acids 1-23 

Leucine zipper pattern, 

amino acids 10-32 

N-myristoylation sites. 

amino acids 64-70, 78-84, 80-86, 91-97, 201-207 
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FIGURE 2Q 



CAATGTTTGCCTATCCACCTCCCCCAAGCCCCTTTACCTATGCTGCTGCTAACGCTGCTGCTGCT 
GCTGCTGCTGCTTASAGGCTCATGCTTGGAGTGGGGACTGGTCGGTGCCCAGAAAGTCTCTTCTG 
CCACTGACGCCCCCATCAGGGATTGGGCCTTCTTTCCCCCTTCCTTTCTGTGTCTCCTGCCTCAT 
CGGCCTGCCATGACCTGCAGCCAAGCCCAGCCCCGTGGGGAAGGGGAGAAAGTGGGGGATGGCTA 
AGAAAGCTGGGAGATAGGGAACAGAAGAGGGTAGTGGGTGGGCTAGGGGGGCTGCCTTATTTAAA 
GTGGTTGTT TATGAT TC T TATAC TAAT T TATACAAAGATAT TAAGGC CC T GT TCAT TAAGAAATT 
GTTCCCTTCCCCTGTGTTCAATGTTTGTAAAGATTGTTCTGTGTAAATATGTCTTTATAATAAAC 
AGTTAAAAGCTGAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE SO 

MLLLT LLLLLLLLKGS CLE WGLVGAQKVS SAT DAP I RDWAFFP P S FLCLL PHRP AMT C S QAQPRG 
EGEKVGDG 



Important features: 
Signal peptide: 

amino acids 1-15 

jp Growth factor and cytokines receptors family: 

amino acids 3-18 

Cfl 
III 

m 
m 

u 

IS 

ill 
If! 
O 
fy 
O 
III 
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FIGURE 31 

GTTTGAATTCCTTCAACTATACCCACAGTCCAAAAGCAGACTCACTGTGTCCCAGGCTACCAGTT 
CCTCCAAGCAAGTCATTTCCCTTATTTAACCGATGTGTCCCTCAAACACCTGAGTGCTACTCCCT 
ATTTGCATCTGTTTTGATAAATGATGTTGACACCCTCCACCGAATTCTAAGTGGAATCATGTCGG 
GAAGAGATACAATCCTTGGCCTGTGTATCCTCGCATTAGCCTTGTCTTTGGCCATGATGTTTACC 
TTCAGATTCATCACCACCCTTCTGGTTCACATTTTCATTTCATTGGTTATTTTGGGATTGTTGTT 
TGTCTGCGGTGTTTTATGGTGGCTGTATTATGACTATACCAACGACCTCAGCATAGAATTGGACA 
CAGAAAGGGAAAATATGAAGTGCGTGCTGGGGTTTGCTATCGTATCCACAGGCATCACGGCAGTG 
CTGCTCGTCTT GAT TTTTGTTCT C AGAAAGAGAAT AAAAT TGAC AGTT GAGC T T T T CCAAAT CAC 
AAATAAAGCCATCAGCAGTGCTCCCTTCCTGCTGTTCCAGCCACTGTGGACATTTGCCATCCTCA 
TTTTCTTCTGGGTCCTCTGGGTGGCTGTGCTGCTGAGCCTGGGAACTGCAGGAGCTGCCCAGGTT 
ATGGAAGGCGGCCAAGTGGAATATAAGCCCCTTTCGGGCATTCGGTACATGTGGTCGTACCATTT 
AATTGGCCTCATCTGGACTAGTGAATTCATCCTTGCGTGCCAGCAAATGACTATAGCTGGGGCAG 
TGGTTACTTGTTATTTCAACAGAAGTAAAAATGATCCTCCTGATCATCCCATCCTTTCGTCTCTC 
TCCATTCTCTTCTTCTACCATCAAGGAACCGTTGTGAAAGGGTCATTTTTAATCTCTGTGGTGAG 
GATTCCGAGAATCATTGTCATGTACATGCAAAACGCACTGAAAGAACAGCAGCATGGTGCATTGT 
CCAGGTACCTGTTCCGATGCTGCTACTGCTGTTTCTGGTGTCTTGACAAATACCTGCTCCATCTC 
AAC C AG AAT GC AT AT AC T AC AAC T GC TAT T AAT G G GAC AGAT T T C T G T ACAT C AG C AAAAGAT GC 
AT T CAAAAT CT T GTCCAAGAACT CAAG T CAC TT TACATCTAT TAAC TGC T T TGGAGAGT T CATAA 
TTTTTCTAGGAAAGGTGTTAGTGGTGTGTTTCACTGTTTTTGGAGGACTCATGGCTTTTAACTAC 
AATCGGGCATTCCAGGTGTGGGCAGTCCCTCTGTTATTGGTAGCTTTTTTTGCCTACTTAGTAGC 
CCATAGTTTTTTATCTGTGTTTGAAACTGTGCTGGATGCACTTTTCCTGTGTTTTGCTGTTGATC 
T GGAAACAAATGAT GGATCG TCAGAAAAGCCC TACT T TATGGATCAAGAATT TCT GAGT T TCGTA 
AAAAGGAGCAACAAAT TAAACAAT GCAAGGGCACAGCAGGACAAGCACTCAT TAAGG AATGAG GA 
GGGAACAGAACT CCAGGCCAT T GT GAGATAGATACCCAT TTAGGT ATCT GTACCT GGAAAACAT T 
T C C TTC TAAGAGCCAT T TACAGAATAGAAGAT GAGACCACTAGAGAAAAGT TAGT GAAT T T T T T T 
T TAAAAGAC C T AATAAACCC TAT T CT T CC T CAAAA 
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FIGURE 32 

MSGRDTILGLCILALALSLA^FTFRFITTLLW 

LDTERENMKCVLGFAIVSTGITAVLLVLIFVLRKRIKLTVELFQITNKAISSAPFLLFQPLWTFA 
ILIFFWVLWVAVLLSLGTAGAAQVMEGGQVEYKPLSGIRYMWSYHLIGLIWTSEFILACQQMTIA 
GAWTCYFNRSKNDPPDHP ILSSLS I LFFYHQGTWKGS FLI S WRI PRI IVMYMQNALKEQQHG 
ALSRYLFRCCYCCFWCLDKYLLHLNQNAYTTTAINGTDFCTSAKDAFKILSKNSSHFTSINCFGD 
FI I FLGKVLWCFTVFGGLMAFNYNRAFQVWAVPLLLVAFFAYLVAHS FLSVFETVLDALFLCFA 
VDLE TNDGS SEKPY FMDQE FLS FVKRSNKLNNARAQQDKHSLRNEEGTELQAI VR 

Important features: 
Signal peptide: 

amino acids 1-20 

Putative transmembrane domains: 

amino acids 35-54, 75-97, 126-146, 185-204, 333-350, 352-371 

N-glycosylation sites . 

amino acids 204-208, 295-299, 313-317 

N-myristoylation sites. 

amino acids 147-153, 178-184, 196-202, 296-275, 342-348 



APP_ID=10063561 



Page 182 



FIGURE 33 

GTTCGATTAGCTCCTCTGAGAAGAAGAGAAAAGGTTCTTGGACCTCTCCCTGTTTCTTCCTTAGA 
ATAAT T T GTAT GGGAT T T GT GAT GCAGGAAAGC CT AAGGGAAAAAGAATAT T CAT T CT G T GT GGT 
G AAAAT T T T T T G AAAAAAAAAT TGCCTTCTT C AAAC AAG G G T G T CAT T C T GAT AT T T ATGAG G AC 
TGTTGTTCTCACTATGAAGGCATCTGTTATTGAAATGTTCCTTGTTTTGCTGGTGACTGGAGTAC 
AT TCAAACAAAGAAACGGCAAAGAAGAT TAAAAGGCCCAAG T TCAC T GTGCCTCAGAT CAAC TGC 
GATGTCAAAGCCGGAAAGATCATCGATCCTGAGTTCATTGTGAAATGTCCAGCAGGATGCCAAGA 
CCCCAAATACCATGTTTATGGCACTGACGTGTATGCATCCTACTCCAGTGTGTGTGGCGCTGCCG 
TACACAGTGGTGTGCTTGATAATTCAGGAGGGAAAATACTTGTTCGGAAGGTTGCTGGACAGTCT 
GGTTACAAAGGGAGTTATTCCAACGGTGTCCAATCGTTATCCCTACCACGATGGAGAGAATCCTT 
3 : TATCG T CT TAGAAAGTAAACCCAAAAAGGGTGTAACCTACCCATCAGC TCT TACATACT CAT CAT 

12 CGAAAAGTCCAGCTGCCCAAGCAGGT GAGACCACAAAAGCCTATCAGAGGCCACCTAT TCCAGGG 

D ACAACTGCACAGCCGGTCACTCTGATGCAGCTTCTGGCTGTCACTGTAGCTGTGGCCACCCCCAC 
Q CACC T TGCCAAGGCCAT CCC CT TCTGC TGCT TC T ACCACCAGCAT CCCCAGACCACAAT CAGTGG 

fl| GCCACAGGAGCCAGGAGATGGATCTCTGGTCCACTGCCACCTACACAAGCAGCCAAAACAGGCCC 
l\l AGAGCTGATCCAGGTATCCAAAGGCAAGATCCTTCAGGAGCTGCCTTCCAGAAACCTGTTGGAGC 
?2; GGAT GTCAGCCT GGGAC T TGTT C CAAAAGAAGAAT T GAGCACACAGT C T T TGGAGCCAGT ATCCC 

tfi TGGGAGATCCAAACTGCAAAATTGACTTGTCGTTTTTAATTGATGGGAGCACCAGCATTGGCAAA 
01 CGGCGATTCCGAATCCAGAAGCAGCTCCTGGCTGATGTTGCCCAAGCTCTTGACATTGGCCCTGC 
CGGT CCACT GAT GGGT GTT GTCCAGT AT GGAGACAACCC TGC T AC T CAC TT TAAC CTCAAGACAC 
* ACACGAAT T CT CGAGATCT GAAGACAGCCATAGAGAAAAT TACTCAGAGAGGAGGACT T TCTAAT 

GTAGGTCGGGCCATCTCCTTTGTGACCAAGAACTTCTTTTCCAAAGCCAATGGAAACAGAAGCGG 

0 GGCTCCCAATGTGGTGGTGGTGATGGTGGATGGCTGGCCCACGGACAAAGTGGAGGAGGCTTCAA 

01 GACT TGCGAGAGAGT CAGGAAT CAACAT T T TC T T CAT CACCAT T GAAGGTGC TGC T GAAAAT GAG 
fl AAGCAGTATGTGGTGGAGCCCAACTTTGCAAACAAGGCCGTGTGCAGAACAAACGGCTTCTACTC 
JJ;j GCTCCACGTGCAGAGCTGGTTTGGCCTCCACAAGACCCTGCAGCCTCTGGTGAAGCGGGTCTGCG 
*!f ACACTGACCGCCTGGCCTGCAGCAAGACCTGCTTGAACTCGGCTGACATTGGCTTCGTCATCGAC 
l-l GGCTCCAGCAGTGTGGGGACGGGCAACTTCCGCACCGTCCTCCAGTTTGTGACCAACCTCACCAA 
fll AGAGT T TGAGAT T T CCGACACGGACACGCGCAT CGGGGCCGT GCAG TACACCTAC GAACAGCGGC 

T GGAGT TT GGGT TCGACAAGTACAGCAGCAAGCCT GACAT CC TCAACGCCAT CAAGAGGG TGGGC 

TACTGGAGTGGTGGCACCAGCACGGGGGCTGCCATCAACTTCGCCCTGGAGCAGCTCTTCAAGAA 

GT CCAAGCCCAACAAGAGGAAGT TAAT GATCC T CATCACCGACGGGAGGTCC T ACGACGACG T C C 

GGATCCCAGCCATGGCTGCCCATCTGAAGGGAGTGATCACCTATGCGATAGGCGTTGCCTGGGCT 

GCCCAAGAGGAGC TAGAAGTCAT T GCCACT CACCCCGCCAGAGACCACTCC T T CT T T GT GGACGA 

G T T TGACAACCT CCATCAGTATGT CCCCAGGAT CATCCAGAACAT T T GTACAGAGT T CAAC TCAC 

AGCCTCGGAACTGAATTCAGAGCAGGCAGAGCACCAGCAAGTGCTGCTTTACTAACTGACGTGTT 

GGACCACCCCACCGCTTAATGGGGCACGCACGGTGCATCAAGTCTTGGGCAGGGCATGGAGAAAC 

AAATGTCTTGTTATTATTCTTTGCCATCATGCTTTTTCATATTCCAAAACTTGGAGTTACAAAGA 

T GATCACAAACGTATAGAATGAGCCAAAAGGCTACATCATGTTGAGGGTGCTGGAGATT T TACAT 

T T T GACAAT T GT T T T CAAAATAAATGT TCGGAAT ACAGTGCAGCC C T TACGACAGGC T T AC G TAG 

AGCTTTTGTGAGATTTTTAAGTTGTTATTTCTGATTTGAACTCTGTAACCCTCAGCAAGTTTCAT 

T T T TGT CAT GACAATGTAGGAAT T GCTGAAT T AAAT GT T TAGAAGGAT GAAAAATAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE 34 

MRTVVLTMKASVIEMFLVLLVTGVHSNKETAKKIKRPKFTVPQINCDVKAGKIIDPEFIVKCPAG 
CQDPKYHVYGTDVYASYSSVCGAAVHSGVLDNSGGKILVRKVAGQSGYKGSYSNGVQSLSLPRWR 
ESFIVLESKPKKGVTYPSALTYSSSKSPAAQAGETTKAYQRPPIPGTTAQPVTLMQLLAVTVAVA 
TPTTLPRPSPSAASTTSIPRPQSVGHRSQEMDLWSTATYTSSQNRPRADPGIQRQDPSGAAFQKP 
VGADVSLGLVPKEELSTQSLEPVSLGDPNCKIDLSFLIDGSTSIGKRRFRIQKQLLADVAQALDI 
GPAGPLMGWQYGDNPATHFNLKTHTNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSKANGN 
RSGAPNVWVMVDGWPTDKVEEASRL^ 

FYSLHVQSWFGLHKTLQPLVKRVCDTDRLACSKTCLNSADIGFVIDGSSSVGTGNFRTVLQFVTN 
LTKEFEISDTDTRIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQL 
FKKSKPNKRKLMI LI TDGRS YDDVRI PAMAAHLKGVI T YAI GVAWAAQEELEVI ATHPARDHS FF 
VDEFDNLHQYVPRI IQNICTEFNSQPRN 

Important features: 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 181-200 

N-glycosylation sites. 

amino acids 390-394, 520-524 

N-myristoylation sites. 

amino acids 23-29, 93-99, 115-121, 262-268, 367-373, 389-395, 
431-437, 466-472, 509-515, 570-576, 571-577, 575-581, 627-633 

Amidation site. 

amino acids 304-308 
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FIGURE 35 



CCGAGCACAGGAGATTGCCTGCGTTTAGGAGGTGGCTGCGTTGTGGGAAAAGCTATCAAGGAAGAAATTGC 
CAAACC ATGT CT TTTT TT CTGT TT TC AG AGTAGT TC AC AACAGATCTG AGTGTT T T AATT AAGC AT GGAAT 
ACAGAAAACAACAAAAAACTTAAGCTTTAATTTCATCTGGAATTCCACAGTTTTCTTAGCTCCCTGGACCC 
GGTTGACCTGTTGGCTCTTCCCGCTGGCTGCTCTATCACGTGGTGCTCTCCGACTACTCACCCCGAGTGTA 
AAGAACCTTCGGCTCGCGTGCTTCTGAGCTGCTGTGGATGGCCTCGGCTCTCTGGACTGTCCTTCCGAGTA 
GGATGTCACTGAGATCCCTCAAATGGAGCCTCCTGCTGCTGTCACTCCTGAGTTTCTTTGTGATGTGGTAC 
CTCAGCCT TCCC C ACT AC AATGTG AT AG AACGCGTGAACT GGAT GT ACTT CT AT GAGT AT GAGCCG AT TT A 
CAGACAAGACTTTCACTTCACACTTCGAGAGCATTCAAACTGCTCTCATCAAAATCCATTTCTGGTCATTC 
T GGT GACCTCC CACCC TTC AGATGTGAAAGCC AG GCAGGCC ATT AGAGT T ACTT GGGGTG AAAAAAAGTCT 
TGGT GGGGAT AT GAGGT T CT T ACAT TTT TCT T AT TAGGCC AAGAGGCT GAAAAGGAAG AC AAAAT G TT GGC 
AT TGT CCTTAG AGGATGAAC ACCT TCTTTATGGT GAC ATAAT CCGACAAGAT TT TTT AGACACAT AT AAT A 
ACCT GACC TT GAAAACC ATT AT GGCATTCAGGTGGGT AACTGAGT T TT GCCCCAATGCCAAGT ACGT AAT G 
AAGACAGACACTGATGTTTTCATCAATACTGGCAATTTAGTGAAGTATCTTTTAAACCTAAACCACTCAGA 

GAAGTTT T TC ACAGGT TATC CT CT AATT GAT AATTAT TCCTATAGAGG ATTT TACC AAAAAACC CAT AT T T 
CTTACCAGGAGTATCCTTTCAAGGTGTTCCCTCCATACTGCAGTGGGTTGGGTTATATAATGTCCAGAGAT 
TT GGT GCC AAGG AT CT AT GAAATG AT GGGTC ACGTAAAACCC ATCAAGT TT GAAGATGTT T ATGTC GGGAT 
CTGTTTGAATTTATTAAAAGTGAACATTCATATTCCAGAAGACACAAATCTTTTCTTTCTATATAGAATCC 
AT TT GGAT GT CTGT CAACT G AGACGT GT GATT GCAGCC C ATGGCT T TT CTTCCAAGGAGAT CAT CACTTTT 
T GGC AGGT CAT GCT AAGG AACACC AC AT GCCATT AT TAACTT CACAT T CT AC AAAAAG CCT AGAAGGAC AG 
GATACCTTGTGGAAAGTGTTAAATAAAGTAGGTACTGTGGAAAATTCATGGGGAGGTCAGTGTGCTGGCTT 
AC ACT GAACT G AAACT CAT GAAAAACCC AG AC TGGAGACT GG AGGGTT AC ACTT GT GATT T ATT AGTCAGG 
CC CT TC AAAGAT GAT AT GTGGAGG AATT AAAT AT AAAGGAAT T G GAGGTTTT TGCT AAAG AAAT T AAT AGG 
ACCAAACAATTTGGACATGTCATTCTGTAGACTAGAATTTCTTAAAAGGGTGTTACTGAGTTATAAGCTCA 
CTAGGCTGTAAAAACAAAACAATGTAGAGTTTTATTTATTGAACAATGTAGTCACTTGAAGGTTTTGTGTA 
TATCTTATGTGGATTACCAATTTAAAAATATATGTAGTTCTGTGTCAAAAAACTTCTTCACTGAAGTTATA 
CT GAAC AAAAT TTT ACCT GT TT TTGGTC AT TT AT AAAGT ACT TC AAGATGTT GC AGT ATT T C AC AGTT AT T 
ATTATTTAAAAT T ACT TCAACT TTGTGTTTTTAAATGT TTTGACGATTTCAATACAAGATAAAAAGGATAG 
TGAAT CAT TCT T T AC ATGC AAACATT TT CC AGTT ACT T AACTGATCAGT T TATT AT TGAT AC AT C ACT CCA 
TTAATGTAAAGTCATAGGTCATTATTGCATATCAGTAATCTCTTGGACTTTGTTAAATATTTTACTGTGGT 
AATATAGAGAAGAATTAAAGCAAGAAAATCTGAAAA 



FIGURE 36 

MASALWTVLPSRMSLRSLKWSLLLL^ 

TLREHSNCSHQNPFLVILVTSHPSDVKARQAIRVTWGEKKSWWGYEVLTFFLLGQEAEKEDKMLA 
LSLEDEHLLYGDIIRQDFLDTYNNLTLKTIMAFRWVTEFCPNAKYVMKTDTDVFINTGNLVKYLL 
NLNHSEKFFTGYPLIDNYSYRGFYQKTHISYQEYPFKVFPPYCSGLGYIMSRDLVPRIYEMMGHV 
KPIKFEDVYVGICLNLLKVNIHIPEDTNLFFLYRIHLDVCQLRRVIAAHGFSSKEIITFWQVMLR 
NTTCHY 

Important features : 

Type II transmembrane domain: 

amino acids 20-39 

N-glycosylation sites. 

amino acids 72-76, 154-158, 198-202, 212-216, 326-330 

Glycosaminoglycan attachment site. 

amino acids 239-243 

Ly-6 / u-PAR domain proteins. 

amino acids 23-37 

N-myristoylation site. 

amino acids 271-277 
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FIGURE 37 



CGCTCGGGCACCAGCCGCGGCAAGGMGGAGCTGGGTTGCTGGACGCAGTTGGGGCTCACTTTTCTTCAGCTCCTTCTCATC 
TCGT CC T TGCC AAGAGAGTAC ACAGTCATTAATGAAGCC TGCCC TGGAGCAGAGTGGAATATCATGTGT CGGGAGTGCT GT G 
AATATGATC AGAT TGAGTGC GTC TGCC CC GGAAAGAGGGAAGTCGTGGGT TAT AC C AT CC C TT GC TGCAGGAATGAGGAGAA 
T GAGTGTGACTCC TGCCTGAT CCAC CCAGGTT GTACCATC TTT GAAAAC TGCAAGAGC TGCCGAAATGGC TCATGGGGGGG T 
ACCT TGGAT GAC TTC TATGTGAAGGGGT T C TACTGTGCAGAGT GCCGAGCAGGCT GGTAC GGAGGAGAC TGC ATGCGATGTG 
GCCAGGT TC TGCGAGC CCCAAAGGGTC AGATT T TGT TGGAAAGC TATC CCC TAAATGC TC ACTGT GAATGGACCAT TCATGC 
TAAACCTGGGT T TGT C ATCCAAC TAAGAT T TGTCATGTTGAGTC TGGAGTT TGAC TACATGTGCCAGTATGAC TATGTT GAG 
GTTC GTGAT GGAGACAACCGCGATGGC C AGATCATCAAGCGTGTCTGTGGC AACGAGCGGCCAGC TCC TAT C CAGAGCAT AG 
GATC C TCACTCCACGT CC TC T TC C ACT C CGATGGC TCCAAGAATTT TGACGGT TTCCAT GCCAT T TATGAGGAGATCACAGC 
ATGC TCC T CAT CCC C T TGTT TCCAT GAC GGCACGTGCGT CC TT GACAAGGC TGGATC TTACAAGTGTGC CTGCTTGGCAGGC 
TATACTGGGCAGC GC T GTGAAAATC TC C T TGAAGAAAGAAACT GCTCAGACCC TGGGGGC CCAGT CAAT GGGTACCAGAAAA 
TAACAGGGGGC CC TGGGC TTATC AACGGACGCCATGC TAAAAT T GGCAC CGTGGTGTCT T TC T T TTGTAACAAC TCC TATGT 
TCT TAGTGGCAAT GAGAAAAGAAC T TGCCAGCAGAATGGAGAGT GGTC AGGGAAACAGCC CAT C TGCATAAAAGCC TGC CGA 
GAAC C AAAGAT TT CAGACCTGGT GAGAAGGAGAGT TC TTCC GAT GCAGGTTCAGTCAAGGGAGAC AC CAT TAC ACC AGC TAT 
ACTC AGCGGCC TT CAGCAAGCAGAAAC TGC AGAGTGCCCC TAC CAAGAAGC CAGCCC TTCCCT T T GGAGATC TGCCCAT GGG 
ATACC AA.CATC TGCATACCCAGC T C CAGTATGAGTGC AT C TCACCCTTC TACCGC CGCC TGGGCAGCAGCAGGAGGACAT GT 
C TGAGGAC T GGGAAGTGGAGTGGGCGGGC ACC ATCC TGCAT CCC TATC TGCGGGAAAAT T GAGAACATCAC TGCTCCAAAGA 
CCCAAGGGT TGCGCTGGCCGTGGC AGGCAGCCATC TACAGGAGGACC AGCGGGGTGCAT GACGGCAGCC TAC ACAAGGGAGC 
GTGGTTCCTAGTCTGCAGCGGTGCCCTGGTGAATGAGCGCACTGTGGTGGTGGCTGCCCACTGTGTTACTGACCTGGGGAAG 
GTCACCAT GATC AAGAC AGCAGAC C TGAAAGT TGT T T TGGGGAAATTC TACCGGGAT GAT GACC GGGAT GAGAAGAC CATCC 
AGAGCCTACAGATTTCTGCTATCATTCTGCATCCCAACTATGACCCCATCCTGCTTGATGCTGACATCGCCATCCTGAAGCT 
C CTAGACAAGGCCCGT ATCAGCAC C CGAGTCCAGCCCATC TGC C TCGC TGCC AGT C GGGATCTCAGCAC TTCC TTC CAGGAG 
TCC CACAT CAC TGTGGC TGGCTGGAAT GT CC TGGCAGAC GTGAGGAGCCC TGGCT TCAAGAACGACACACTGCGCTC TGGGG 
T GGT CAGTGTGGTGGAC TCGCTGC T GT GT GAGGAGC AGC ATGAGGACCATGGCATCCCAGTGAGTGTCAC TGATAACATGT T 
CTGTGCCAGCTGGGAACCCACTGCCCCTTCTGATATCTGCACTGCAGAGACAGGAGGCATCGCGGCTGTGTCCTTCCCGGGA 
CGAGC ATC TCC TGAGC CACGCTGGC AT CTGATGGGAC TGGTCAGC TGGAGC TATGATAAAACAT GCAGCCAC AGGC TCTC C A 
C TGCCTTC ACCAAGGT GC TGCCT T T TAAAGAC TGGAT TGAAAGAAATATGAAAT^ACCATGC TCATGC AC TC C TTGAGAAG 
TGTTTCTGTATATCCGTCTGTACGTGTGTCATTGCGTGAAGCAGTGTGGGCCTGAAGTGTGATTTGGCCTGTGAACTTGGCT 
GTGCCAGGGC T TC TGAC TTC AGGGACAAAAC TCAGTGAAGGGT GAGT AGACC TCC AT TGC TGGTAGGCT GATGCCGCGT CCA 
C TAC TAGGACAGC CAAT TGGAAGAT GCCAGGGC T TGCAAGAAG TAAGT TTC T TC AAAGAAGACCATATACAAAACC TCTCC A 
C TC C ACTGACC T GGTGGTC T TCC CCAAC T TTCAGT TATAC GAAT GCCAT CAGC T TGAC CAGGGAAGAT C TGGGCT TCAT GAG 
GCCCCTTTTGAGGCTCTCAAGTTC TAGAGAGC TGCC TGT GGGAC AGCC CAGGGCAGC AGAGCTGGGAT G TGGTGCATGCC T T 
TGTGTACATGGCCACAGTACAGTCTGGTCCTTTTCCTTCCCCATCTCTTGTACACATTTTAATAAAATAAGGGTTGGCTTCT 
GAAC T AC AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 38 

MELGCWTQLGLTFLQLLLISSLPREYTVINEACPGAEWNIMCRECCEYDQIECVCPGKREWGYT 
IPCCRNEENECDSCLIHPGCTIFENCKSCRNGSWGGTLDDFYVKGFYCAECRAGWYGGDCMRCGQ 
VLRAPKGQILLESYPLNAHCEWTIHAKPGFVIQLRFVMLSLEFDYMCQYDYVEVRDGDNRDGQII 
KRVCGNERPAP I QS I GS S LHVL FHS DGS KNFDGFHAI YEE I TACS S S PC FHDGTCVLDKAGS YKC 
ACLAGYTGQRCENLLEERNCSDPGGPVNGYQKITGGPGLINGRHAKIGTWSFFCNNSYVLSGNE 
KRTCQQNGEWSGKQPICIKACREPKISDLVRRRVLPMQVQSRETPLHQLYSAAFSKQKLQSAPTK 
KPALPFGDLPMGYQHLHTQLQYECISPFYRRLGSSRRTCLRTGKWSGRAPSCIPICGKIENITAP 
KTQGLRWPWQAAIYRRTSGVHDGSLHKGAWFLVCSGALVNERTWVAAHCVTDLGKVTMIKTADL 
KWLGKFYRDDDRDEKT I QS LQI S AI I LHPNYDPI LLDADI AI LKLLDKAR I S TRVQP I CLAASR 
DLS TS FQESH I TVAGWNVLADVRS PGFKNDTLRS GWSWDS LLCEEQHEDHGI PVS VTDNMFCA 
SWEPTAPSDICTAETGGIAAVSFPGRASPEPRWHLMGLVSWSYDKTCSHRLSTAFTKVLPFKDWI 
ERNMK 

Important features of the protein: 
Signal peptide: 

amino acids 1-23 

EGF-like domain cysteine pattern signature . 
amino acids 260-272 
N-glycosylation sites. 

amino acids 96-100, 279-283, 316-320, 451-455, 614-618 
N-rnyristoylation sites. 

amino acids 35-41, 97-103, 256-262, 284-290, 298-304, 308-314, 

474-480, 491-497, 638-644, 666-672 

Amidation site. 

amino acids 56-60 

Serine proteases, trypsin family. 

amino acids 489-506 

CUB domain proteins profile. 

amino acids 150-167 
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FIGURE 3Q 



GGTTCCTACATCCTCTCATCTGAGAATCAGAGAGCATAATCTTCTTACGGGCCCGTGATTTATTAACGTGGCTTAATC 
TGAAGGTTCTCAGTCAAATTCTTTGTGATCTACTGATTGTGGGGGCATGGCAAGGTTTGCTTAAAGGAGCTTGGCTGG 
TTTGGGCCCTTGTAGCTGACAGAAGGTGGCCAGGGAGAATGCAGCACACTGCTCGGAGA ATGA AGGCGCTTCTGTTGC 
TGGTCTTGCCTTGGCTCAGTCCTGCTAACTACATTGACAATGTGGGCAACCTGCACTTCCTGTATTCAGAACTCTGTA 
AAGGT GCCT C CCAC TACGGC CTGAC CAAAGAT AGGAAGAG GCGCT CACAAGAT GGCT GT CCAGACGG C T GTGC GAGCC 
TCACAGCCACGGCTCCCTCCCCAGAGGTTTCTGCAGCTGCCACCATCTCCTTAATGACAGACGAGCCTGGCCTAGACA 
ACCCT GC CTACGT GT CC T C G GCAGAGGACGGGCAG CCAGCAAT CAGC CCAGT GGACT CT G GCCGGAGCAACC GAACTA 
GGGCACGGC CCTTT GAGAGAT CCACTATT AGAAGCAGATCAT TTAAAAAAAT AAATC GAGCTTT GAGT GTTCT T C GAA 
H ! GGACAAAGAG CGGGAGT GCAGTT GC CAACCAT GCC GACCAGGGCAGGGAAAAT T CT GAAAACACCACT GC CCCT GAAG 

1*1 TCTT T CCAAGGTT GTAC CAC CT GATTC CAGAT GGT GAAATTACCAGCAT CAAGATCAAT CGAGTAGAT CCCAGT GAAA 

GCCTCTCTATTAGGCTGGT GGGAGGTAGCGAAACCCCACTGGTCCATAT CATTATCCAACACATT TAT CGTGAT GGGG 

ff z s 

V T GAT CGC CAGAGAC GGC C GG CTACT GC CAGGAGACATCATTC TAAAGGT CAACGGGAT GGACAT CAGCAATGT CC CT C 

IIP ACAACTACGCTGTGCGTCTCCTGCGGCAGCCCTGCCAGGTGCTGTGGCTGACTGTGATGCGTGAACAGAAGTTCCGCA 
%H GCAGGAACIAATGGACAGGCCCCGGATGCCTACAGACCCCGAGATGACAGCTTTCATGTGATTCTCAACAAAAGTAGCC 
I? 3 CCGAGGAGCAGCTTGGAATAAAACTGGTGCGCAAGGTGGATGAGCCTGGGGTTTTCATCTTCAATGTGCTGGATGGCG 
GT GT GGCAT AT CGACAT GGT CAGCTT GAGGAGAATGACCGTGTGTTAGC CAT CAATGGACAT GAT CTT CGATAT GGCA 
S GCCCAGAAAGTGCGGCTCATCTGATTCAGGCCAGTGAAAGACGTGTTCACCTCGTCGTGTCCCGCCAGGTTCGGCAGC 
GGAGCCCTGACATCTTTCAGGAAGCCGGCTGGAACAGCAATGGCAGCTGGTCCCCAGGGCCAGGGGAGAGGAGCAACA 
CT CCCAAGCCCCT CCAT C CTACAAT TACT T GT CAT GAGAAGGTGGTAAAT AT CGAAAAAGACC C CGGT GAAT CT CTC G 
W GCAT GAC CGT C GCAGGGGGAGCATCACATAGAGAAT GGGAT TT GCCTAT CTAT GT CAT CAGT GTT GAGCC CGGAGGAG 

"I T CATAAG CAGAGAT GGAAGAATAAAAACAGGT GACATTT T GTTGAAT GT GGAT GGGGT CGAACT GACAGAGGT CAGCC 

%J GGAGT GAGGCAGTGGCATTATT GAAAAGAACAT CAT C CT C GATAGT ACT CAAAGCTTT GGAAGT CAAAGAGTAT GAGC 

CCCAGGAAGACTGCAGC^GCCC^GCAGCCCTGGACTCCWVCCACAA(^TGGCCCCACCCAGTGACTGGTCCCCATCCT 
GGGTCAT GT GGC TGGAATTACCACGGT G C TTGTATAACT GTAAAGATATT GTATTAC GAAGAAACACAGCTG GAAGT C 
T GGGCTTCT GCATT GTAGGAGGTTAT GAAGAATACAAT GGAAACAAACCT TT TTT CAT CAAAT C CAT T GTTGAAGGAA 
CACCAGCATACAAT GAT G GAAGAATTAGATGTG GT GATAT TCTT CTT GCT GT CAATGGTAGAAGTACAT CAGGAATGA 
TACAT GCTT GCT TGGCAAGACT GCT GAAAGAACT T AAAG GAAGAATTACT CTAAC TAT T GTTT CT TGGCCTG GCACTT 
TT TT ATAGA AT CAAT GAT GGGT CAGAGGAAAACAGAAAAAT CACAAATAGGCTAAGAAGTTGAAACACTATATT T AT C 
T T GT CAGTTT TTAT ATT TAAAGAAAGAATACATT GTAAAAAT GT CAGGAAAAGTATGAT CAT CTAAT GAAAGC CAGT T 



AACATT GTTTATATT TT TT CT AT T CAATAAAAAGC CCTAAAACAACTAAAAT GATTGATTTGT ATACCC CACT GAAT T 
CAAGCT GATT TAAATTTAAAATTT GGTAT AT GCTGAAGTCT GCCAAGGGTACATTAT GG CCAT TTTTAAT TT ACAGCT 
AAAATAT TTTT T AAAAT GCATT GCT GAGAAACGT T GCTT T CAT CAAACAAGAATAAATAT TT T T CAGAAGTT AAA 



13 
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FIGURE AO 

MKALLLLVLPWLSPANYIDNVGNLHFLYSELCKGASHYGLTKDRKRRSQDGCPDGCASLTATAPS 
PEVSAAATISLMTDEPGLDNPAYVSSAEDGQPAISPVDSGRSNRTRARPFERSTIRSRSFKKINR 
AL S VLRRT KS G S AVANHADQGRENS ENT TAPE VFPRL YHL I PDGE I TS I KINRVDPSESLS IRLV 
GGSETPLVHI 1 1 QHI YRDGVIARDGRLLPGDI I LKVNGMDISNVPHNYAVRLLRQPCQVLWLTVM 
RE QKFRS RNNGQAPDAYRPRDDS FHVI LNKS S PEEQLG I KLVRKVDE PGVFI FNVLDGGVAYRHG 
QLEENDRVLAINGHDLRYGSPESAAHLIQASERRVHLWSRQVRQRSPDIFQEAGWNSNGSWSPG 
PGERSNTPKPLHPTITCHEKWNIQKDPGESLGMTVAGGASHREWDLPIYVISVEPGGVISRDGR 
IKTGDILLNVDGVELTEVSRSEAVALLKRTSSSIVLKALEVKEYEPQEDCSSPAALDSNHNMAPP 
SDWSPSWVMWLELPRCLYNCKDIVLRRNTAGSLGFCIVGGYEEYNGNKPFFIKSIVEGTPAYNDG 
RIRCGDILLAVNGRSTSGMIHACLARLLKELKGRITLTIVSWPGTFL 

Important features: 
Signal peptide: 

amino acids 1-15 

N-glycosylation sites. 

amino acids 108-112, 157-161, 289-293, 384-388 

Tyrosine kinase phosphorylation sites. 

amino acids 433-441, 492-500 

N-myristoylation sites. 

amino acids 51-57, 141-147, 233-239, 344-350, 423-429, 447-453, 
467-473, 603-609 
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FIGURE 41 



ACCAGGCATTGTATCTTCAGTTGTCATCAAGTTCGCAATCAGATTGGAAAAGCTCAACTTGAAGCTTT 
CTTGCCTGCAGTGAAGCAGAGAGATAGATATTATTCACGTAATAAAAAACATGGGCTTCAACCTGACT 
TTCCACCTTTCCTACAAATTCCGATTACTGTTGCTGTTGACTTTGTGCCTGACAGTGGTTGGGTGGGC 
CACCAGTAACTACTTCGTGGGTGCCAT TCAAGAGATTCC TAAAGCAAAGGAGTTCAT GGCTAAT TTCC 
ATAAGACCCTCAT TTTGGGGAAGG GAAAAACTCTGACTAAT GAAGCAT CCACGAAGAAGGTAGAACT T 
GACAACTGTGCTTCTGTGTCTCCTTACCTCAGAGGCCAGAGCAAGCTCATTTTCAAACCAGATCTCAC 
TTTGGAAGAGGTACAGGCAGAAAATCCCAAAGTGTCCAGAGGCCGGTATCGGCCTCAGGAATGTAAAG 
Q CTTTACAGAGGGTCGCCATCCTCGTTCCCCACCGGAACAGAGAGAAACACCTGATGTACCTGCTGGAA 
O CATCTGCATCCCTTCCTGCAGAGGCAGCAGCTGGATTATGGCATCTACGTCATCCACCAGGCTGAAGG 
f I TA?\AAAGTTTAATCGAGCCAAACTCTTGAATGTGGGCTATCTAGAAGCCCTCAAGGAAGAAAATTGGG 
ACTGCTTTATATTCCACGATGTGGACCTGGTACCCGAGAATGACTTTAACCTTTACAAGTGTGAGGAG 

m 

: 5 i CATCCCAAGCATCTGGTGGTTGGCAGGAACAGCACTGGGTACAGGTTACGTTACAGTGGATATTTTGG 
GGGTGTTACTGCCCTAAGCAGAGAGCAGTTTTTCAAGGTGAATGGATTCTCTAACAACTACTGGGGAT 
, GGGGAGGCGAAGACGATGACCTCAGACTCAGGGTT GAGCT CCAAAGAAT GAAAAT T TCCCGGCCCCT G 

O C CT GAAGTGGGTAAAT ATACAAT GGTC T T CCACACTAGAGACAAAGGCAAT GAGGTGAACGCAGAAC G 

ill GATGAAGCTCT TACACCAAGT GTCACGAGTCT GGAGAACAGAT GGGTTGAGTAGTTGTTCTTATAAAT 

y TAGT ATCTGTGGAACACAAT CCTT TATATAT CAACATCACAGTGGATT TCTGGTTTGGTGCATGACCC 

jLJf TGGATCTTTTGGTGATGTTTGGAAGAACTGATTCTTTGTTTGCAATAATTTTGGCCTAGAGACTTCAA 
IS ATAGTAGCACACATTAAGAACCTGTTACAGCTCATTGTTGAGCTGAATTTTTCCTTTTTGTATTTTCT 
TAG C AGAG CTCCTGGT GAT G T AGAGT AT AAAAC AG T T G T AACAAG AC AG C T T T CT TAGT CAT T T T GAT 
CATGAGGGTTAAATATT GTAATAT GGAT ACT TGAAGGACT T TAT ATAAAAGGATGACTCAAAGGATAA 
AAT GAACGCTATT TGAGGACTCTGGTT GAAGGAGATT TAT T TAAATT T GAAGTAATATAT TAT GGGAT 
AAAAGGC CACAGGAAAT AAGAC TG CT GAAT G T C T G AGAGAACCAG AG TTGT TCTCGT CCAAGG TAGAA 
AGGTACGAAGATACAATACTGTTATTCATTTATCCTGTACAATCATCTGTGAAGTGGTGGTGTCAGGT 
GAGAAGGCGTCCACAAAAGAGGGGAGAAAAGGCGACGAATCAGGACACAGTGAACTTGGGAATGAAGA 
GGTAGCAGGAGGGTGGAGTGTCGGCTGCAAAGGCAGCAGTAGCTGAGCTGGTTGCAGGTGCTGATAGC 
CTTCAGGGGAGGACCTGCCCAGGTATGCCTTCCAGTGATGCCCACCAGAGAATACATTCTCTATTAGT 
T TT TAAAGAGT TT TT GTAAAAT GATTT T GTACAAGTAGGATAT GAAT T AGCAGTT TACAAGTTTACAT 
ATTAACT AATAAT AAAT AT GT CT AT CAAATAC CT CT GT AG T AAAAT GT G AAAAAG C AAAA 
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FIGURE 42 

MGFNLTFHLSYKFRLLLLLTLCLTWGWATSNYFVGAIQEIPKAKEFMANFHKTLILGKGKTLTN 
EASTKKVELDNCPSVSPYLRGQSKLIFKPDLTLEEVQAENPKVSRGRYRPQECKALQRVAILVPH 
RmEKHLMYLLEHLHPFLQRQQLDYGIWIHQAEGKKFNRAKLLNVGYLEALKEENWDCFIFHDV 
DLVPENDFNLYKCEEHPKHLWGRNSTGYRLRYSGYFGGVTALSREQFFKVNGFSNNYWGWGGED 
DDLRLRVELQRKKISRPLPEVGKYTMVFHTRDKGNEVNAERMKLLHQVSRVWRTDGLSSCSYKLV 

SVEHNPLY INI T VD FW FGA 

Important features: 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites. 

amino acids 4-8,. 220-224, 335-339 

Xylose isomerase proteins. 

amino acids 191-202 
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FIGURE 43 

GCTCAAGACCCAGCAGTGGGACAGCCAGACAGACGGCACGATGGCACTGAGCTCCCAGATCTGGG 

CCGCTTGCCTCCTGCTCCTCCTCCTCCTCGCCAGCCTGACCAGTGGCTCTGTTTTCCCACAACAG 

ACGGGACAACTTGCAGAGCTGCAACCCCAGGACAGAGCTGGAGCCAGGGCCAGCTGGATGCCCAT 

GTTCCAGAGGCGAAGGAGGCGAGACACCCACTTCCCCATCTGCATTTTCTGCTGCGGCTGCTGTC 

ATCGATCAAAGTGTGGGATGTGCTGCAAGACGTAfiAACCTACCTGCCCTGCCCCCGTCCCCTCCC 

TTCCTTATTTATTCCTGCTGCCCCAGAACATAGGTCTTGGAATAAAATGGCTGGTTCTTTTGTTT 

TCCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 4.4 



MALS SQI WAACLLLLLLLAS LT SGS VFPQQTGQLAELQPQDRAGAFIAS WMPMFQRRRRRDTHFP I 
CIFCCGCCHRSKCGMCCKT 

Important features: 
Signal peptide: 

amino acids 1-24 

t 

%J cAMP- and cGMP-dependent protein kinase phosphorylation site. 

ill 

tf% amino acids 58-59 

Uj 

||1 N-myristoylation site. 

m 

amino acids 44-50 

u 



O 
III 

c:i 

fi! 

o 
rii 



Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 1-12 
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FIGURE 45 

GTGGCTTCATTTCAGTGGCTGACTTCCAGAGAGCAATATGGCTGGTTCCCCAACATGCCTCACCC 
TCATCTATATCCTTTGGCAGCTCACAGGGTCAGCAGCCTCTGGACCCGTGAAAGAGCTGGTCGGT 
TCCGTTGGTGGGGCCGTGACTTTCCCCCTGAAGTCCAAAGTAAAGCAAGTTGACTCTATTGTCTG 
GACCTTCAACACAACCCCTCTTGTCACCATACAGCCAGAAGGGGGCACTATCATAGTGACCCAAA 
ATCGTAATAGGGAGAGAGTAGACTTCCCAGATGGAGGCTACTCCCTGAAGCTCAGGAAACTGAAG 
AAGAATGACTCAGGGATCTACTATGTGGGGATATACAGCTCATCACTCCAGCAGCCCTCCACCCA 
GGAGTACG T GCT GCATGTC TACGAGCACCTGTCAAAGCC TAAAGT CACCAT GGGT CTGCAGAGCA 
ATAAGAATGGCACCTGTGTGACCAATCTGACATGCTGCATGGAACATGGGGAAGAGGATGTGATT 
TATACCTGGAAGGCCCTGGGGCAAGCAGCCAATGAGTCCCATAATGGGTCCATCCTCCCCATCTC 
C TGGAGATGGGGAGAAAGT GATATGACCT TCATC TGCGTT GCCAGGAACCC T GT CAGCAGAAACT 
TCTCAAGCCCCATCCTTGCCAGGAAGCTCTGTGAAGGTGCTGCTGATGACCCAGATTCCTCCATG 
GTCCTCCTGTGTCTCCTGTTGGTGCCCCTCCTGCTCAGTCTCTTTGTACTGGGGCTATTTCTTTG 
GTTTCTGAAGAGAGAGAGACAAGAAGAGTACATTGAAGAGAAGAAGAGAGTGGACATTTGTCGGG 
AAACTCC TAACATATGCCCCCAT T CT GGAGAGAACACAGAGTACGACACAATCCCT CACACTAAT 
AGAACAATCCTAAAGGAAGATCCAGCAAATACGGTTTACTCCACTGTGGAAATACCGAAAAAGAT 
GGAAAATCCCCAC TCAC TGC TCACGATGCGAGACACACCAAGGC TATT TGCCTAT GAGAATG TTA 
TCTAGACAGCAGTGCACTCCCCTAAGTCTCTGCTCA 
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FIGURE d6 

MAGS PTCL TL I Y I LWQLTGSAAS GPVKELVGSVGGAVT FPLKSKVKQVDS I VWT FNT T PL VT I QP 
EGGTIIVTQNRNRERVDFPDGGYSLKLSKLKKNDSGIYYVGIYSSSLQQPSTQEYVLHVYEHLSK 
PKVTMGLQSNKNGTCVTNLTCCMEHGEEDVI YTWKALGQAANE SHNGS I L P I S WRWGE SDMT F I C 
"VARNPVSRNFSSPILARKLCEGA71DDPDSSMVLLCLLLVPLLLSLFVLGLFLWFLKRERQEEYIE 
EKKRVDI CRETPNI CPHS GENTEYDT I PHTNRT I LKEDPANTVYSTVEIPKKMENPHSLLTMPDT 
PRLFAYENVI 

Important features: 
Signal peptide: 

amino acids 1-22 

Transmembrane domain: 

amino acids 224-250 

Leucine zipper pattern. 

amino acids 229-251 

N-glycosylation sites. 

amino acids 98-102, 142-146, 148-152, 172-176, 176-180, 204-208, 
291-295 
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FIGURE 47 



GGCTCGAGCGTTTCTGAGCCAGGGGTGACCATGACCTGCTGCGAAGGATGGACATCCTGCAATGG 
ATTCAGCCTGCTGGTTCTACTGCTGTTAGGAGTAGTTCTCAATGCGATACCTCTAATTGTCAGCT 
TAGTTGAGGAAGACCAATTTTCTCAAAACCCCATCTCTTGCTTTGAGTGGTGGTTCCCAGGAATT 
ATAGGAGCAGGTCTGATGGCCATTCCAGCAACAACAATGTCCTTGACAGCAAGAAAAAGAGCGTG 

CT GCAACAACAGAAC TGGAATGTT T CT TT CAT CAT T T T TCAGT GT GAT CACAGT CAT T GGTGC T C 
TGTATTGCATGCTGATATCCATCCAGGCTCTCTTAAAAGGTCCTCTCATGTGTAATTCTCCAAGC 
AACAGTAATGCCAAT T GT GAAT T T T CAT T GAAAAAC ATC AGTGACAT TCAT CCAGAAT CC T T CAA 
CTTGCAGTGGTTTTTCAATGACTCTTGTGCACCTCCTACTGGTTTCAATAAACCCACCAGTAACG 
AC AC CAT GGCGAGT GGCT GGAGAGCAT CTAGT T T CCACT TCGAT T C T GAAGAAAACAAACATAGG 
CTTATCCACTTCTCAGTATTTTTAGGTCTATTGCTTGTTGGAATTCTGGAGGTCCTGTTTGGGCT 
CAGTCAGATAGTCATCGGTTTCCTTGGCTGTCTGTGTGGAGTCTCTAAGCGAAGAAGTCAAATTG 
T GTAGT T TAAT GGGAATAAAAT GT AAGTAT CAGTAGT T TGAAAAAAAAAAA 



FIGURE 48 



MTCCEGWTS CNG FSLLVLLLLGWLNAI PL I VSLVEEDQFSQNP I SCFEWWFPG 1 1 GAGLMAI PA 
T TMSLTARKRACCNNRTGMFLS S FFS VI TVI GALYCML I S I QALLKGPLMCNS PSNSNANCE FSL 
KNISDIHPESFNLQWFFNDSCAPPTGFNKPTSNDTMASGWRASSFHFDSEENKHRLIHFSVFLGL 

LLVGILEVLFGLSQIVIGFLGCLCGVSKRRSQIV 

Important features: 
Transmembrane domains: 

amino acids' 10-31 (type II), 50-72, 87-110, 191-213 
N-glycosylation sites. 

amino acids 80-84, 132-136, 148-152, 163-167 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 223-227 

N-myristoylation sites. 

amino acids 22-28, 54-60, 83-89, 97-103, 216-222 

Prokaryotic membrane lipoprotein lipid attachment site* 

amino acids 207-218 

TNFR/NGFR family cysteine-rich region protein. 

amino acids 4-12 
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FIGURE AQ 

ATCCGTTCTCTGCGCTGCCAGCTCAGGTGAGCCCTCGCCAAGGTGACCTCGCAGGACACTGGTGA 
AGGAGCAGTGAGGAACCTGCAGAGTCACACAGT T GCT GACCAAT TGAGCT GT GAGCCT GGAGCAG 
ATCCGTGGGCTGCAGACCCCCGCCCCAGTGCCTCTCCCCCTGCAGCCCTGCCCCTCGAACTGTGA 
CATGGAGAGAGTGACCCTGGCCCTTCTCCTACTGGCAGGCCTGACTGCCTTGGAAGCCAATGACC 
CAT T TGCCAATAAAGACGAT CCC T T C TAC TATGACT GGAAAAACCTGCAGC TGAGCGGACTGATC 
TGCGGAGGGCTCCTGGCCATTGCTGGGATCGCGGCAGTTCTGAGTGGCAAATGCAAATACAAGAG 
CAGCCAGAAGCAGCACAGTCCTGTACCTGAGAAGGCCATCCCACTCATCACTCCAGGCTCTGCCA 
CTACTTGCTGAGCACAGGACTGGCCTCCAGGGATGGCCTGAAGCCTAACACTGGCCCCCAGCACC 
TCCTCCCCTGGGAGGCCTTATCCTCAAGGAAGGACTTCTCTCCAAGGGCAGGCTGTTAGGCCCCT 
T T CT GATCAGGAGGCT TCT T TAT GAAT TAAACT CGCCCCACCACCCCC TCA 
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FIGURE so 



MERVTLALLLLAGLTALEANDPFANKDDPFYYDWKNLQLSGLICGGLLAIAGIAAVLSGKCKYKS 
S QKQHS PVPEKAI PL I TPGSAT TC 



Important features: 
Signal peptide: 

amino acids 1-16 

u 
a 

0 • 
m 

M 

||| N-myristoylation sites. 

81 amino acids 41-47, 45-51, 84-90 

u 



Transmembrane domain: 

amino acids 36-59 



Extracellular proteins SCP/Tpx-l/Ag5/PR-l/Sc7 . 

amino acids 54-67 
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FIGURE si 

GTGGACTCTGAGAAGCCCAGGCAGTTGAGGACAGGAGAGAGAAGGCTGCAGACCCAGAGGGAGGG 
AGGACAGGGAG T CGGAAGGAGGAGGACAGAGGAGGGCACAGAGACGCAGAGCAAGGGC GGCAAGG 
AGGAGACCCTGGTGGGAGGAAGACACTCTGGAGAGAGAGGGGGCTGGGCAGAGATGAAGTTCCAG 
GGGCCCCTGGCCTGCCTCCTGCTGGCCCTCTGCCTGGGCAGTGGGGAGGCTGGCCCCCTGCAGAG 
CGGAGAGGAAAGCACTGGGACAAATATTGGGGAGGCCCTTGGACATGGCCTGGGAGACGCCCTGA 
GCGAAGGGGTGGGAAAGGCCATTGGGAAAGAGGCCGGAGGGGCAGCTGGCTCTAAAGTCAGTGAG 
*f GCCCTTGGCCAAGGGACCAGAGAAGCAGTTGGCACTGGAGTCAGGCAGGTTCCAGGCTTTGGCGC 
" AGCAGATGCTTTGGGCAACAGGGTCGGGGAAGCAGCCCATGCTCTGGGAAACACTGGGCACGAGA 
|| T T GGCAGACAGGCAGAAGAT GT CATT CGACACGGAGCAGAT GCT G TCCGCGGCT C CT GGCAGGGG 

||| GTGCCTGGCCACAGTGGTGCTTGGGAAACTTCTGGAGGCCATGGCATCTTTGGCTCTCAAGGTGG 
ill. CCTTGGAGGCCAGGGCCAGGGCAATCCTGGAGGTCTGGGGACTCCGTGGGTCCACGGATACCCCG 
J M GAAAC T CAGCAGGCAGC T T T GGAAT GAATCC TCAGGGAGC TCCCT GGGGTCAAGGAGGCAAT GGA 

GGGCCACCAAACTTTGGGACCAACACTCAGGGAGCTGTGGCCCAGCCTGGCTATGGTTCAGTGAG 
f!J AGCCAGCAACCAGAATGAAGGGTGCACGAATCCCCCACCATCTGGCTCAGGTGGAGGCTCCAGCA 
111 ACTCTGGGGGAGGCAGCGGCTCACAGTCGGGCAGCAGTGGCAGTGGCAGCAATGGTGACAACAAC 
{3 AATGGCAGCAGCAGTGGTGGCAGCAGCAGTGGCAGCAGCAGTGGCAGCAGCAGTGGCGGCAGCAG 
TGGCGGCAGCAGTGGTGGCAGCAGTGGCAACAGTGGTGGCAGCAGAGGTGACAGCGGCAGTGAGT 
J| CCTCCTGGGGATCCAGCACCGGCTCCTCCTCCGGCAACCACGGTGGGAGCGGCGGAGGAAATGGA 
CATAAACCCGGGTGTGAAAAGCCAGGGAATGAAGCCCGCGGGAGCGGGGAATCTGGGATTCAGGG 
CTTCAGAGGACAGGGAGTTTCCAGCAACATGAGGGAAATAAGCAAAGAGGGCAATCGCCTCCTTG 
GAGGCTCTGGAGACAATTATCGGGGGCAAGGGTCGAGCTGGGGCAGTGGAGGAGGTGACGCTGTT 
GGTGGAGTCAATACTGTGAACTCTGAGACGTCTCCTGGGATGTTTAACTTTGACACTTTCTGGAA 
GAATT TTAAATCCAAGC TGGGTTTCATCAACTGGGATGCCATAAACAAGGACCAGAGAAGC TC TC 
GCATCCCGTGACCTCCAGACAAGGAGCCACCAGATTGGATGGGAGCCCCCACACTCCCTCCTTAA 
AACACCACCCTCTCATCACTAATCTCAGCCCTTGCCCTTGAAATAAACCTTAGCTGCCCCACAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA?^^ 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 52 



MKFQGPLACLLLALCLGSGEAGPLQSGEESTGTNIGEALGHGLGDALSEGVGKAIGKEAGGAAGSKVS 
EALGQGTREAVGTGVRQVPGFGAADALGNRVGEAAHALGNTGHEIGRQAEDVIRHGADAVRGSWQGVP 
GHSGAWETSGGHGIFGSQGGLGGQGQGNPGGLGTPWVHGYPGNSAGSFGMNPQGAPWGQGGNGGPPNF 
GTNTQGAVAQPGYGSVRASNQNEGCTNPPPSGSGGGSSNSGGGSGSQSGSSGSGSNGDNNNGSSSGGS 
SSGSSSGSSSGGSSGGSSGGSSGNSGGSRGDSGSESSWGSSTGSSSGNHGGSGGGNGHKPGCEKPGNE 
ARGSGESGIQGFRGQGVSSNMREISKEGNRLLGGSGDNYRGQGSSWGSGGGDAVGGVNTVNSETSPGM 
M FN F DT FWKNFKS KLG F I NWDAINKDQR S S R I P 



Signal peptide: 

amino acids 1-21 



ff| N-glycosylation site. 

I s * amino acids 265-269 

Glycosaminoglycan attachment site. 

*t 3 amino acids 235-239, 237-241, 244-248, 255-259, 324-328, 388-392 

p| Casein kinase II phosphorylation site. 

fjj amino acids 26-30, 109-113, 259-263, 300-304, 304-308 



N-myristoylation site. 

amino acids 17-23, 32-38, 42-48, 50-56, 60-66, 61-67, 64-70, 74-80, 

90-96, 96-102, 130-136, 140-146, 149-155, 152-158, 155-161, 

159-165, 163-169, 178-184, 190-196, 194-200, 199-205, 218-224, 

236-242, 238-244, 239-245, 240-246, 245-251, 246-252, 249-252, 

253-259, 256-262, 266-272, 270-276, 271-277, 275-281, 279-285, 

283-289, 284-290, 287-293, 288-294, 291-297, 292-298, 295-301, 

298-304, 305-311, 311-317, 315-321, 319-325, 322-328, 323-329, 

325-331, 343-349, 354-360, 356-362, 374-380, 381-387, 383-389, 
387-393, 389-395, 395-401 

Cell attachment sequence. 

amino acids 301-304 
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FIGURE sa 

GGAGAAGAGGTTGTGTGGGACAAGCTGCTCCCGACAGAAGGA2£TCGCTGCTGAGCCTGCCCTGG 
CTGGGCCTCAGACCGGTGGCAATGTCCCCATGGCTACTCCTGCTGCTGGTTGTGGGCTCCTGGCT 
ACTCGCCCGCATCCTGGCTTGGACCTATGCCTTCTATAACAACTGCCGCCGGCTCCAGTGTTTCC 
CACAGCCCCCAAAACGGAAC TGGT T T TGGGGT CACCT GGGCCT GAT CACTCCTACAGAGGAGGGC 
TTGAAGGACTCGACCCAGATGTCGGCCACCTATTCCCAGGGCTTTACGGTATGGCTGGGTCCCAT 
CATCCCCTTCATCGTTTTATGCCACCCTGACACCATCCGGTCTATCACCAATGCCTCAGCTGCCA 

S ; 

fp*. TTGCACCCAAGGATAATCTCTTCATCAGGTTCCTGAAGCCCTGGCTGGGAGAAGGGATACTGCTG 
f t; 1= AGTGGCGGTGACAAGTGGAGCCGCCACCGTCGGATGCTGACGCCCGCCTTCCATTTCAACATCCT 
ffl GAAGTCCTATATAACGATCTTCAACAAGAGTGCAAACATCATGCTTGACAAGTGGCAGCACCTGG 

W CCTCAGAGGGCAGCAG TCGTC TGGACAT GT TT GAGCACATCAGCCT CATGACCT T GGACAGTC T A 

Iff 

Jl: CAGAAAT GCAT CT TCAGCTT TGACAGCCATTG T CAGGAGAGGCCCAGTGAATATATTGCCACCAT 

|7 CTTGGAGCTCAGTGCCCTTGTAGAGAAAAGAAGCCAGCATATCCTCCAGCACATGGACTTTCTGT 
, ATTACCTCTCCCATGACGGGCGGCGCTTCCACAGGGCCTGCCGCCTGGTGCATGACTTCACAGAC 
O GC TGTCATCC GGGAGCGGCG T CGCACCC T CCC CACTCAGGGTAT TGATGAT T TT T TCAAAGACAA 

y AGCCAAGTCCAAGACTTTGGATTTCATTGATGTGCTTCTGCTGAGCAAGGATGAAGATGGGAAGG 
*ft CATTGTCAGATGAGGATATAAGAGCAGAGGCTGACACCTTCATGTTTGGAGGCCATGACACCACG 
GCCAGTGGCCTCTCCTGGGTCCTGTACAACCTTGCGAGGCACCCAGAATACCAGGAGCGCTGCCG 
ffj ACAGGAGGTGCAAGAGCTTCTGAAGGACCGCGATCCTAAAGAGATTGAATGGGACGACCTGGCCC 
AGCTGCCCTTCCTGACCATGTGCGTGAAGGAGAGCCTGAGGTTACATCCCCCAGCTCCCTTCATC 
TCCCGATGCTGCACCCAGGACATTGTTCTCCCAGATGGCCGAGTCATCCCCAAAGGCATTACCTG 
CCTCATCGATATTATAGGGGTCCATCACAACCCAACTGTGTGGCCGGATCCTGAGGTCTACGACC 
CCTTCCGCTTTGACCCAGAGAACAGCAAGGGGAGGTCACCTCTGGCTTTTATTCCTTTCTCCGCA 
GGGCCCAGGAACTGCATCGGGCAGGCGTTCGCCATGGCGGAGATGAAAGTGGTCCTGGCGTTGAT 
GCTGCTGCACTTCCGGTTCCTGCCAGACCACACTGAGCCCCGCAGGAAGCTGGAATTGATCATGC 
GCGCCGAGGGCGGGCTTTGGCTGCGGGTGGAGCCCCTGAATGTAGGCTTGCAGTGACTTTCTGAC 
C CAT CC AC CTGTTTTTTT GCAGAT T GT CAT GAAT AAAACGG TGC T G T CAAA 
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FIGURE 

MSLLSLFWLGLRFVAMSPWLLLLLW 

LITPTEEGLKDSTQMSATYSQGFTVWLGPI IPFIVLCHPDTIRSITNASAAIAPKDNLFIRFLKP 

WLGEGILLSGGDKWSRHRRMLTPAFHFNILKSYITIFNKSANIMLDKWQHLASEGSSRLDMFEHI 

SLMTLDSLQKCIFSFDSHCQERPSEYIATILELSALVEKRSQHILQHMDFLYYLSHDGRRFHRAC 

RLVHDFTDAVI RERRRTLPTQG I DDFFKDKAKSKTLDFI DVLLLSKDEDGKALS DEDI RAEADT F 

MFGGHDTTASGLSWLYNLARHPEYQERCRQEVQELLKDRDPKEIEWDDLAQLPFLTMCVKESLR 

LHPPAPFISRCCTQDIVLPDGRVIPKGITCLIDIIGVHHNPTWPDPEVYDPFRFDPENSKGRSP 

LAFIPFSAGPRNCIGQAFAMAEMKWIJUjMLLHFRFLPDHTEPRRKLELIMRAEGGL 

VGLQ 

Important features: 
Transmembrane domains: 

amino acids 13-32 (type II), 77-102 

Cytochrome P450 cysteine heme-iron ligand signature. 

amino acids 4 61-471 

N-glycosylation sites. 

amino acids 112-116, 168-172 
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FIGURE 55 

ATCGCATCAATTGGGAGTACCATCTTCCTCATGGGACCAGTGAAACAGCTGAAGCGAATGTTTGA 
GCCTACTCGTTTGATTGCAACTATCATGGTGCTGTTGTGTTTTGCACTTACCCTGTGTTCTGCCT 
TTTGGTGGCATAACAAGGGACTTGCACTTATCTTCTGCATTTTGCAGTCTTTGGCATTGACGTGG 
TACAGCCTTTCCTTCATACCATTTGCAAGGGATGCTGTGAAGAAGTGTTTTGCCGTGTGTCTTGC 
ATAAT TCATGGCCAGTTTTATGAAGCT TTGGAAGGCACTATGGACAGAAGCTGGT GGACAGT TTT 
GTAACTATCTTCGAAACCTCTGTCTTACAGACATGTGCCTTTTATCTTGCAGCAATGTGTTGCTT 
G TGAT TCGAACAT TT GAGGGT TACT T T TGGAAGCAACAATACAT T C T CGAACCT GAAT G TCAG TA 
GCACAGGATGAGAAGTGGGTTCTGTATCTTGTGGAGTGGAATCTTCCTCATGTACCTGTTTCCTC 
TCTGGATGTTGTCCCACTGAATTCCCATGAATACAAACCTATTCAGCAACAGCAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE z6 

MGPVKQLKRMFE PTRL I AT IMVLLC FALT LCS AFWWHNKGLAL I FC I LQSLALTW YSLS F I P EAR 
DAVKKC FAVC LA 



Important features: 
Signal peptide: 

amino acids 1-33 

Q 

f% Type II fibronectin collagen-binding domain protein. 

01 amino acids 30-72 

w 
in 
m 



a 
wi 

C3 

to 

o 
ru 
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FIGURE 57 



CGGCTCGAGCTCGAGCCGAATCGGCTCGAGGGGCAGTGGAGCACCCAGCAGGCCGCCAACATGCTCTGTCTGTGCCTG 
TAC GT GCCG GT CAT CGGGGAAGCC C AGAC CGAGT T CCAGTACTTT GAGT CGAAGGGGCT C CCT G C CGAGCT GAAGT CC 
ATTTT CAAGCT CAGT GTCTT CAT C CC CT CCCAGGAATT CT CCAC CTAC C GCCAGT GGAAG C AGAAAATT GT ACAAGCT 
GGAGATAAGGACCTT GAT G GGCAGCT AGACT TT GAAGAATT T GT CCATTAT CT C CAAGAT CAT GAGAAGAAGCT GAGG 
CTGGT GTTTAAGATT TT GGACAAAAAGAAT GAT GGACGCAT T GACGCGCAGGAGAT CAT GC AGT CC CT GCGGGACTTG 
GGAGT CAAGATAT CT GAACAGCAGGCAGAAAAAATT CT CAAGAGCATGGATAAAAACGG CACGAT GAC CAT CGACTGG 
AACGAGTGGAGAGACTACCACCTCCTCCACCCCGTGGAAAACATCCCCGAGATCATCCTCTACTGGAAGCATTCCACG 
ATCTTT GAT GT GGGT GAGAAT CTAACGGT C CCGGAT GAGT T CACAGTGGAGGAGAGGCAGACGGGGAT GTG GT GGAGA 
CACCT GGT GGCAGGAGGT GGGGCAGGGGCC GTAT C CAGAACCT GC ACGGCC CCC C T GGACAGGCT CAAGGT GCT CAT G 
CAGGT CCAT GC CT CC C GCAGCAACAACAT GG GCAT CGTT GGT GGCT TCACT CAGAT GATTC GAGAAGGAGGGGCCAGG 
. j T CACT CT GGCGGGGCAAT GGC AT CAACGT CCTCAAAATT GCCCC C GAAT CAGCCAT CAAATT CAT GGC CTATGAGCAG 

f* h ATCAAGCGCCTTGTTGGTAGTGACCAGGAGACTCTGAGGATTCACGAGAGGCTTGTGGCAGGGTCCTTGGCAGGGGCC 
Q ATCGCCCAGAGCAGCAT CTAC CCAAT GGAG GTC CT GAAGACCCGGAT GG C GCT GCGGAAGACAGGCCAGTACT CAGGA 

^ AT GCT GGACTGCGC CAGGAGGAT CCT GGCCAGAGAGGGGGT GGC CGCCTTCTACAAAGGCTAT GT C CCCAACAT GCT G 

S'- '' G GCAT CAT CCC CTAT GC CGGCAT CGACCTT GCAGT CTACGAGACGCTCAAGAAT GCCT GGCTGCAGCACTATG CAGT G 

ffl AACAGCGCGGACCCCGGCGTGTTTGTGCTCCTGGCCTGTGGCACCATGTCCAGTACCTGTGGCCAGCTGGCCAGCTAC 
l[\ CCCCTGGCCCTAGTCAGGACCCGGATGCAGGCGCAAGCCTCTATTGAGGGCGCTCCGGAGGTGACCATGAGCAGCCTC 
f£ T TCAAACATAT CCT GC GGACC GAG GGGGCCT TCG GGCT GTACAGGGGGC T GGC C CC CAACTTCATGAAGGT CAT C CCA 

|l| GCT GT GAGCAT CAGCTACGTGGT CTACGAGAAC CT GAAGAT CACCCT GGGCGT GCAGTCGC GGTGACGG GGGGAGG GC 

ff§ CGCCCGGCAGTGGACTCGCTGATCCTGGGCCGCAGCCTGGGGTGTGCAGCCATCTCATTCTGTGAATGTGCCAACACT 
AAGCTGTCTCGAGCCAAGCTGTGAAAACCCTAGACGCACCCGCAGGGAGGGTGGGGAGAGCTGGCAGGCCCAGGGCTT 
|si; GT CCTGCT GACCCCAGCAGACCCT CCT GTTGGTT CCAGCGAAGACCACAGGCATTC CTTAGGGT CCAG GGT CAGCAGG 

I CTCCGGGCTCACATGTGTAAGGACAGGACATTTTCTGCAGTGCCTGCCAATAGTGAGCTTGGAGCCTGGAGGCCGGCT 
; TAGTTCTTCCATTTCACCCTTGCAGCCAGCTGTTGGCCACGGCCCCTGCCCTCTGGTCTGCCGTGCATCTCCCTGTGC 
13 CCTCTTGCTGCCTGCCTGTCTGCTGAGGTAAGGTGGGAGGAGGGCTACAGCCCACATCCCACCCCCTCGTCCAATCCC 
|f5 ATAAT CCAT GATGAAAGGT GAGGT CACGT GGCCT CC CAGGCCT GACTT C CCAACCTACAGCATT GAC GCCAACT TGGC 

TGTGAAGGAAGAGGAAAGGATCTGGCCTTGTGGTCACTGGCATCTGAGCCCTGCTGATGGCTGGGGCTCTCGGGCATG 
C$ CTTGGGAGTGCAGGGGGCTCGGGCTGCCTGGCCTGGCTGCACAGAAGGCAAGTGCTGGGGCTCATGGTGCTCTGAGCT 
GGCCTGGACCCTGTCAGGATGGGCCCCACCTC^GAACCAAACTCACTGTCCCCACTGTGGCATGAGGGCAGTGGAGCA 
C CAT GTTT GAGGGC GAAGGGCAGAGC GTTTGTGT GT TCT GGG GAGG GAAGGAAAAGGT GTT GGAGGCCTTAATTAT GG 
fj ACT GTTGGGAAAAGGGT T TTGT CCAGAAGGACAAGCC GGACAAAT GAGCGACTT CT GT GCTT CCAGAGGAAGAC GAGG 

fll GAGCAGGAGCTTGGCTGACTGCTCAGAGTCTGTTCTGACGCCCTGGGGGTTCCTGTCCAACCCCAGCAGGGGCGCAGC 
5 - :;f: GGGACCAGCCCCACATTCCACTTGTGTCACTGCTTGGAACCTATTTATTTTGTATTTATTTGAACAGAGTTATGTCCT 
AACT ATTTTTATAGATT T GTTTAATTAAT AGCTT GT CAT TTT CAAGTT CATTTTTTAT T CATATTTAT GTTCATGGTT 
GATTGTACCTTCCCAAGCCCGCCCAGTGGGATGGGAGGAGGAGGAGAAGGGGGGCCTTGGGCCGCTGCAGTCACATCT 
GTCCAGAGAAATTCCTTTTGGGACTGGAGGCAGAAAAGCGGCCAGAAGGCAGCAGCCCTGGCTCCTTTCCTTTGGCAG 
GTT GGGGAAGGG C TT G CC C CCAGCCT TAGGATTT CAGGGTTT GACT GGGGGCGTGGAGAGAGAGGGAG GAAC CT CAAT 
AACCTTGAAGGTGGAATCCAGTTATTTCCTGCGCTGCGAGGGTTTCTTTATTTCACTCTTTTCTGAATGTCAAGGCAG 
TGAGGTGCCTCTCACTGTGAATTTGTGGTGGGCGGGGGCTGGAGGAGAGGGTGGGGGGCTGGCTCCGTCCCTCCCAGC 
CTTCTGCTGCCCTTGCTTAACAATGCCGGCCAACTGGCGACCTCACGGTTGCACTTCCATTCCACCAGAATGACCTGA 
T GAGGAAAT CTT CAATAGGAT GC AAAGATCAAT GCAAAAAT T GTTAT ATAT GAACATATAACT GGAGT C GT CAAAAAG 
CAAATTAAGAAAGAATT GGACGTTAGAAGTT GT CATTTAAAGCAGCCT T CTAATAAAGTTGT TT CAAAGCT GAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 58 



MLCLCLYVPVIGEAQTEFQYFESKGLPAELKSIFKLSVFIPSQE FSTYRQWKQKIVQAGDKDLDG 
QLDFEEFVHYLQDHEKKLRLVFKILDKKNDGRIDAQEIMQSLRDLGVKISEQQAEKILKSMDKNG 
TMT I DWNEWRDYHLLHPVENI PE 1 1 LYWKHST I FDVGENL TVPDE FTVEERQTGMWWRHL VAGGG 
AGAVSRTCTAPLDRLKVLMQVHASRSNNMGIVGGFTQMIREGGARSLWRGNGINVLKIAPESAIK 
FMAYEQIKRLVGSDQETLRIHERLVAGSLAGAIAQSSIYPMEVLKTRMALRKTGQYSGMLDCARR 
I LAREGVAAFYKG YVPNMLG 1 1 P YAG I DLAVYE TLKNAWLQHYAVNS ADPGVFVXLACGTMS S TC 
GQLAS YPLALVRTRMQAQAS IEGAPEVTMS S L FKH I LRTE GAFGL YRGLAPNFMKVI PAVS I S YV 
VYENLKITLGVQSR 

Important features: 
Signal peptide: 

amino acids 1-16 

Putative transmembrane domains: 

amino acids 284-304, 339-360, 376-394 

Mitochondrial energy transfer proteins signature. 

amino acids 206-215, 300-309 

N-glycosylation sites. 

amino acids 129-133, 169-173 

Elongation Factor-hand calcium-binding protein. 

amino acids 54-73, 85-104, 121-140 
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FIGURE *q 



GGAAGGCAG C G GC AGC T CC AC T C AG CC AG T AC C C AGAT AC GC T GG GAACC T T C C C GAG CCATGG C 
T TCCCTGGGGCAGATCC TCTTCT GGAGCATAAT TAGCATCATCATTAT TCTGGCTGGAGCAAT TG 
CACTCATCATTGGCTTTGGTATTTCAGGGAGACACTCCATCACAGTCACTACTGTCGCCTCAGCT 
GGGAACATTGGGGAGGATGGAATCCTGAGCTGCACTTTTGAACCTGACATCAAACTTTCTGATAr 
CGT GAT AC AAT GGCT GAAGGAAGGT GT TT TAGGCT TGG TCCATGAG T T CAAAGAAGGCAAAGAT G 
AGCT GTCGGAGCAGGATGAAATG T T C AGAGGCC GGACAGCAGT GT T T GC T GAT CAAGT GATAGT T 
!!; GGCAATGCC TC T T TGCGGC TGAAAAACGTGCAACT CACAGAT GCT GGCACCTACAAATGTTATAT 

JS CAT CAC T TC TAAAGGCAAGGGGAAT GC TAACCT TGAGTATAAAACT GGAGCC TT CAGCAT GCCGG 

|f| AAGTGAATGTGGACTATAATGCCAGCTCAGAGACCTTGCGGTGTGAGGCTCCCCGATGGTTCCCC 
III CAGCCCACAGTGGTCTGGGCATCCCAAGTTGACCAGGGAGCCAACTTCTCGGAAGTCTCCAATAC 
Vf CAGCTTTGAGCTGAACTCTGAGAATGTGACCATGAAGGTTGTGTCTGTGCTCTACAATGTTACGA 

m 

TCAACAACACATACT CC TGTATGAT TGAAAAT GACAT TGC CAAAGCAACAGGGGATAT CAAAGT G 
* ACAGAATCGGAGATCAAAAGGCGGAGTCACCTACAGCTGCTAAACTCAAAGGCTTCTCTGTGTGT 
Q CTCTTCTTTCTTTGCCATCAGCTGGGCACTTCTGCCTCTCAGCCCTTACCTGATGCTAAA ATAA T 
Iff GTGCCTTGGCCACAAAAAAGCATGCAAAGTCATTGTTACAACAGGGATCTACAGAACTATTTCAC 
CACCAGATATGACC TAGT T T TATAT TT CT GGGAGGAAAT GAAT T CATATCTAGAAGTCTGGAGT G 
AGCAAACAAGAGCAAGAAACAAAAAGAAGCCAAAAGCAGAAGGCTCCAATATGAACAAGATAAAT 
CTATCTTCAAAGACATATTAGAAGTTGGGAAAATAATTCATGTGAACTAGACAAGTGTGTTAAGA 
GTGATAAGTASAATGCACGTGGAGACAAGTGCATCCCCAGATCTCAGGGACCTCCCCCTGCCTGT 
CACCTGGGGAGTGAGAGGACAGGATAGTGCATGTTCTTTGTCTCTGAATTTTTAGTTATATGTGC 
TGTAATGTTGCTCTGAGGAAGCCCCTGGAAAGTCTATCCCAACATATCCACATCTTATATTCCAC 
AAAT TAAGC TGTAGTATGTACCCTAAGACGC T GCTAAT TGAC TGCCAC TTCGCAAC TCAGGGGC G 
GCTGCATTTTAGTAATGGGTCAAATGATTCACTTTTTATGATGCTTCCAAAGGTGCCTTGGCTTC 
TCT TCCCAAC TGACAAATGCCAAAGT TGAGAAAAATGAT CATAAT T T TAGCATAAACAGAGCAG T 
CGGGGACACCGAT T T T ATAAAT AAAC TGAGCACC T T CT T T T T AAACAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



t y 
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FIGURE 60 



MASLGQILFWSIISIIIILAGAIALIIGFGISGRHSITVTTVASAGNIGEDGILSCTFEPDIKLS 
DIVIQWLKEGVLGLVHEFKEGKDELSEQDEMFRGRTAVFADQVIVGNASLRLKNVQLTDAGTYKC 
YUTS KGKGNANLE YKTGAFSMPEVNVDYNAS S ETLRCEAPRW FPQP TVWAS QVDQGANFSEVS 
NTSFELNSENVTMKWSVLYNVTINNTYSCMIENDIAKATGDIKVTESEIKRRSHLQLLNSKASL 

CVS S FFAI SWALLPLS PYLMLK 



Important features: 
Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 258-281 

N-glycosylation sites. 

amino acids 112-116, 160-164, 190-194, 196-200, 205-209, 216-220, 
220-224 

N-myristoYlation sites. 

amino acids 52-58, 126-132, 188-194 
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FIGURE 6l 



TGACGTCAGAATCACCATGGCCAGCTATCCTTACCGGCAGGGCTGCCCAGGAGCTGCAGGACAAG 

CACCAGGAGCCCCTCCGGGTAGCTACTACCCTGGACCCCCCAATAGTGGAGGGCAGTATGGTAGT 

GGGCTACCCCCTGGTGGTGGTTATGGGGGTCCTGCCCCTGGAGGGCCTTATGGACCACCAGCTGG 

T GGAGGGC CCT ATGGACACCCCAAT CC TGGGAT GTT CCCC TCT GGAAC TCCAGGAGGACCATAT G 

GCGGTGCAGCTCCCGGGGGCCCCTATGGTCAGCCACCTCCAAGTTCCTACGGTGCCCAGCAGCCT 

GGGCTTTATGGACAGGGTGGCGCCCCTCCCAATGTGGATCCTGAGGCCTACTCCTGGTTCCAGTC 

GGTGGACTCAGATCACAGTGGCTATATCTCCATGAAGGAGCTAAAGCAGGCCCTGGTCAACTGCA 

ATTGGTCTTCATTCAATGATGAGACCTGCCTCATGATGATAAACATGTTTGACAAGACCAAGTCA 

GGCCGCATCGATGTCTACGGCTTGTCAGCCCTGTGGAAATTCATCCAGCAGTGGAAGAACCTCTT 

CCAGCAGTATGACCGGGACCGCTCGGGCTCCATTAGCTACACAGAGCTGCAGCAAGCTCTGTCCC 

AAATGGGCTACAACCTGAGCCCCCAGTTCACCCAGCTTCTGGTCTCCCGCTACTGCCCACGCTCT 

GCCAATCCTGCCATGCAGCTTGACCGCTTCATCCAGGTGTGCACCCAGCTGCAGGTGCTGACAGA 

GGCCT TCCGGGAGAAGGACACAGCTGTACAAGGCAACATCCGGCT CAGCTTCGAGGACT TCGTCA 

CCATGACAGCTTCTCGGATGCTATGACCCAACCATCTGTGGAGAGTGGAGTGCACCAGGGACCTT 

TCCTGGCTTCTTAGAGTGAGAGAAGTATGTGGACATCTCTTCTTTTCCTGTCCCTCTAGAAGAAC 

ATTCTCCCTTGCTTGATGCAACACTGTTCCAAAAGAGGGTGGAGAGTCCTGCATCATAGCCACCA 

AATAGTGAGGACCGGGGCTGAGGCCACACAGATAGGGGCCTGATGGAGGAGAGGATAGAAGTTGA 

ATGTCCTGATGGCCATGAGCAGTTGAGTGGCACAGCCTGGCACCAGGAGCAGGTCCTTGTAATGG 

AGTTAGTGTCCAGTCAGCTGAGCTCCACCCTGATGCCAGTGGTGAGTGTTCATCGGCCTGTTACC 

GTTAGTACCTGTGTTCCCTCACCAGGCCATCCTGTCAAACGAGCCCATTTTCTCCAAAGTGGAAT 

CTGACCAAGCATGAGAGAGATCTGTCTATGGGACCAGTGGCTTGGATTCTGCCACACCCATAAAT 

CCTTGTGTGTTAACTTCTAGCTGCCTGGGGCTGGCCCTGCTCAGACAAATCTGCTCCCTGGGCAT 

CTTTGGCCAGGCTTCTGCCCCCTGCAGCTGGGACCCCTCACTTGCCTGCCATGCTCTGCTCGGCT 

T CAGT C TCCAGGAGACAGTGGTCACC T C TCCCTGCCAATAC T T T T T T TAAT T TGCATT T T TT T T C 

AT T TGGGGCCAAAAGT CCAGT GAAAT TGTAAGCTT CAATAAAAGGAT GAAAC T CT GA 
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FIGURE 62 



MASYPYRQGCPGAAGQAPGAPPGSYYPGPPNSGGQYGSGLPPGGGYGGPAPGGPYGPPAGGGPYG 
HPNPGMFPSGTPGGPYGGAAPGGPYGQPPPSSYGAQQPGLYGQGGAPPNVDPEAYSWFQSVDSDH 
SGY I SMKELKQALVNCNWS S FNDETCLMMINMFDKTKS GRI DVYGFSALWKF I QQWKNL FQQYDR 
DRSGSISYTELQQALSQMGYNLSPQFTQLLVSRYCPRSANPAMQLDRFIQVCTQLQVLTEAFREK 
DT AVQGN I RL S FED FVTMT AS RML 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 

N-glycosylation site. 

amino acids 147-150 

Casein kinase II phosphorylation sites. 

amino acids 135-138, 150-153, 202-205, 271-274 

N-myristoylation sites. 

amino acids 9-14, 15-20, 19-24, 33-38, 34-39, 39-44, 43-48, 61- 
66, 70-75, 78-83, 83-88, 87-92, 110-115 
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FIGURE 63 

CAGGATGCAGGGCCGCGTGGCAGG6AGCTGCGCTCCTCTGGGCCTGCTCCTGGTCTGTCTTCATC 
TCCCAGGCCTCTTTGCCCGGAGCATCGGTGTTGTGGAGGAGAAAGTTTCCCAAA^lCTTCGGGACC 
AACTTGCCTCAGCTCGGACAACCTTCCTCCACTGGCCCCTCTAACTCTGAACATCCGCAGCCCGC 
TCTGGACCCTAGGTCTAATGACTTGGCAAGGGTTCCTCTGAAGCTCAGCGTGCCTCCATCAGATG 
GCTTCCCACCTGCAGGAGGTTCTGCAGTGCAGAGGTGGCCTCCATCGTGGGGGCTGCCTGCCATG 
GATTCCTGGCCCCCTGAGGATCCTTGGCAGATGATGGCTGCTGCGGCTGAGGACCGCCTGGGGGA 
AGCGCTGCCTGAAGAACTCTCTTACCTCTCCAGTGCTGCGGCCCTCGCTCCGGGCAGTGGCCCTT 
TGCCTGGGGAGTCTTCTCCCGATGCCACAGGCCTCTCACCTGAGGCTTCACTCCTCCACCAGGAC 
TCGGAGTCCAGACGACTGCCCCGTTCTAATTCACTGGGAGCCGGGGGAAAAATCCTTTCCCAACG 
CCCTCCCTGGTCTCTCATCCACAGGGTTCTGCCTGATCACCCCTGGGGTACCCTGAATCCCAGTG 
TGTCCTGGGGAGGTGGAGGCCCTGGGACTGGTTGGGGAACGAGGCCCATGCCACACCCTGAGGGA 
ATCTGGGGTATCAATAATCAACCCCCAGGTACCAGCTGGGGAAATATTAATCGGTATCCAGGAGG 
CAGCT GGGGAAAT AT TAAT CGGTAT CCAGGAGGCAGCTGGGGGAAT AT TAAT CGGTAT CCAGGAG 
GCAGCTGGGGGAATATTCATCTATACCCAGGTATCAATAACCCATTTCCTCCTGGAGTTCTCCGC 
CCTCCTGGCTCTTCTTGGAACATCCCAGCTGGCTTCCCTAATCCTCCAAGCCCTAGGTTGCAGTG 
GGGCTAGAGCACGATAGAGGGAAACCCAACATTGGGAGTTAGAGTCCTGCTCCCGCCCCTTGCTG 
TGTGGGCTCAATCCAGGCCCTGTTAACATGTTTCCAGCACTATCCCCACTTTTCAGTGCCTCCCC 
T GCT CATC TCCAAT AAAATAAAAGCAC TTAT GAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAA?\AAAAAAAAAAAAAAAAAAAAA?^^ 
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FIGURE 64 



MQGRVAGSCAPLGLLLVCLHLPGLFARSIGWEEKVSQNFGTNLPQLGQPSSTGPSNSEHPQPAL 
DPRSNDLARVPLKLSVPPSDGFPPAGGSAVQRWPPSWGLPAMDSWPPEDPWQMMAAAAEDRLGEA 
LPEELSYLSSAAALAPGSGPLPGESSPDATGLSPEASLLHQDSESRRLPRSNSLGAGGKILSQRP 
PWSLIHRVLPDHPWGTLNPSVSWGGGGPGTGWGTRPMPHPEGIWGINNQPPGTSWGNINRYPGGS 
WGNINRYPGGSWGNINRYPGGSWGNIHLYPGINNPFPPGVLRPPGSSWNIPAGFPNPPSPRLQWG 

Important features of the protein: 
Signal peptide: 

amino acids 1-2 6 

Casein kinase II phosphorylation sites . 

amino acids 56-59, 155-158 

N-myristoylation sites . 

amino acids 48-53, 220-225, 221-226, 224-229, 247-252, 258-263, 
259-264, 269-274, 270-275, 280-285, 281-286, 305-310 
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FIGURE 6* 



AAGGAGAGGCCACCGGGACTTCAGTGTCTCCTCCATCCCAGGAGCGCAGTGGCCACTATGGGGTC 
TGGGCTGCCCCTTGTCCTCCTGTTGACCCTCCTTGGCAGCTCACATGGAACAGGGCCGGGTATGA 
CTTTGCAACTGAAGCTGAAGGAGTCTTTTCTGACAAATTCCTCCTATGAGTCCAGCTTCCTGGAA 
TTGCTTGAAAAGCTCTGCCTCCTCCTCCATCTCCCTTCAGGGACCAGCGTCACCCTCCACCATGC 
AAGATCTCAACACCATGTTGTCTGCAACACATGACAGCCATTGAAGCCTGTGTCCTTCTTGGCCC 
GGGCTTTTGGGCCGGGGATGCAGGAGGCAGGCCCCGACCCTGTCTTTCAGCAGGCCCCCACCCTC 
C TGAGTGGCAATAAATAAAAT T CGGT ATGC T G 
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FIGURE 66 

MGSGLPLVLLLTLLGSSHGTGPGMTLQLKLKESFLTNSSYESSFLELLEKLCLLLHLPSGTSVTL 
HHARS QHHWCNT 



Important features: 
Signal peptide: 

amino acids 1-19 

Q N-glycosylation site. 

amino acids 37-41 

in 
iy 

in N-myristoylation sites. 

m amino acids 15-21, 19-25, 60-66 

u 
Q 

m 

rij 
o 
III 
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FIGURE 67 

ACGGACCGAGGGTTCGAGGGAGGGACACGGACCAGGAACCTGAGCTAGGTCAAAGACGCCCGGGC 
CAGGTGCCCCGTCGCAGGTGCCCCTGGCCGGAGATGCGGTAGGAGGGGCGAGCGCGAGAAGCCCC 
TTCCTCGGCGCTGCCAACCCGCCACCCAGCCCATGGCGAACCCCGGGCTGGGGCTGCTTCTGGCG 
CTGGGCCTGCCGTTCCTGCTGGCCCGCTGGGGCCGAGCCTGGGGGCAAATACAGACCACTTCTGC 
AAATGAGAATAGCACTGTTTTGCCTTCATCCACCAGCTCCAGCTCCGATGGCAACCTGCGTCCGG 
AAGCCATCACTGCTATCATCGTGGTCTTCTCCCTCTTGGCTGCCTTGCTCCTGGCTGTGGGGCTG 
GCACTGTTGGTGCGGAAGCTTCGGGAGAAGCGGCAGACGGAGGGCACCTACCGGCCCAGTAGCGA 
GGAGCAGTTCTCCCATGCAGCCGAGGCCCGGGCCCCTCAGGACTCCAAGGAGACGGTGCAGGGCT 
GCCTGCCCATCTAGGTCCCCTCTCCTGCATCTGTCTCCCTTCATTGCTGTGTGACCTTGGGGAAA 
GGCAGTGCCCTCTCTGGGCAGTCAGATCCACCCAGTGCTTAATAGCAGGGAAGAAGGTACTTCAA 
AGAC TC T GCCCC TGAGGT CAAGAGAGGAT GGGGCTAT TC ACTT T TATATAT T XATATAAAAT TAG 
TAGTGAGATGTAAAAAAAAAAAAAAAAAA 
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FIGURE 68 

MMPGLGLLLALGLPFLLARWGRA^ 

LLAALLIAVGLALLVRKLREKRQTEGTYRPSSEEQFSHAAEARAPQDSKETVQGCLPI 



Important features: 
Signal peptide: 

amino acids 1-19 

Transmembrane domain: 

amino acids 5 6-80 

N-glycosylation site. 

amino acids 36-40 

cAMP- and c(34P-dependent protein kinase phosphorylation site. 

amino acids 8 6-90 

Tyrosine kinase phosphorylation site. 

amino acids 8 6-94 

N-myristoylation sites. 

amino acids 7-13, 26-32 
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FIGURE 6q 

GC C AG GAATAACT AGAGAGGAACAATGGGGTTATT CAGAGGT T TT GTTT T CCT CT TAGTT CTGTGC CT GCT GCACCAG 
T CAAATACTT CCTT CAT TAAGCT GAATAAT AAT GGCTT TGAAGATATT GT CATTGTTATAGAT CCTAGTGT GC CAGAA 
GATGAAAAAATAATTGAACAAATAGAGGATATGGTGACTACAGCTTCTACGTACCTGTTTGAAGCCACAGAAAAAAGA 
TTTTTTTTCAAAAAT GTATCTATATTAATT CCT GAGAATT GGAAGGAAAAT CCTCAGTACAAAAGG CCAAAACAT GAA 
AAC CATAAACAT GCT GAT GTT ATAGT T GCACGACCTACACT CC CAG GT AGAGATGAACCATACACCAAGCAGT TCACA 
GAATGT GGAGAGAAAGGC GAATACATT CACTT CAC CCCT GACCTT CTACTT GGAAAAAAACAAAAT GAAT ATGGACCA 
CCAGGCAAACT GTTT GT C CAT GAGT GGGCT CAC CT CCG GT GGGGAGTGT TT GAT GAGTACAAT GAAGAT CAGCCTT T C 
TAC CGT GCT AAGT CAAAAAAAAT CGAAGCAACAAGGT GT T C CGCAGGT AT CT CT GGTAGAAAT AGAGTT TATAAGT GT 
CAAGGAGGCAGCT GT CTT AGTAGAGCAT GCAGAAT T GAT TCTACAACAAAACT GT AT GGAAAAGATTGT CAATT CTTT 
CCT GATAAAGTACAAACAGAAAAAGCAT C CAT AAT GTT T AT GCAAAGTATT GAT T CTGT T GTT GAATTT T GTAACGAA 
AAAAC CCATAAT CAAGAAGCT CCAAGCCTACAAAACATAAAGT GCAATT TTAGAAGTACAT GGGAGGT GAT TAGCAAT 
TCTGAGGATTTTAAAAACACCATACCCATGGTGACACCACCTCCTCCACCTGTCTTCTCATTGCTGAAGATCAGTCAA 
AGAAT T GT GT GCTTAGTT CTT GATAAGT CT GGAAGCAT G GGGG GTAAGGAC CGCCTAAAT C GAAT GAAT C AAG CAGCA 
AAACATTTCCTGCTGCAGACTGTTGAAAATGGATCCTGGGTGGGGATGGTTCACTTTGATAGTACTGCCACTATTGTA 
AATAAGCTAAT CCAAATAAAAAGCAGTGAT GAAAGAAACACACT CATGGCAGGATTACCTACATAT CCT CT GGGAGGA 
ACTTC CAT CT GCT CT GGAATTAAATATGCATTT CAGGT GAT TGGAGAGCTACAT T CCCAACT C GAT GGAT CCGAAGTA 
CTGCTGCTGACTGATGGGGAGGATAACACTGCAAGTTCTTGTATTGATGAAGTGAAACAAAGTGGGGCCATTGTTCAT 
TTT ATT GCTT T GGGAAGAGCT GCT GATGAAGCAGT AATAGAGAT GAGCAAGATAACAGGAGGAAGT CAT TT TTAT GTT 
T CAGAT GAAGCT CAGAACAAT GGC CT CATT GAT GCTTT T GGGGCTCTTACATCAGGAAATACT GATCT CTCCCAGAAG 
T CC CTT CAGCTCGAAAGTAAGGGATT AACACTGAAT AGTAATGCCT G GATGAACGACACT GTCATAATT GATAGTACA 
GTGGGAAAGGACACGTTCTTTCTCATCACATGGAACAGTCTGCCTCCCAGTATTTCTCTCTGGGATCCCAGTGGAACA 
AT AAT GGAAAATTTCACAGT GGAT GCAACTT CCAAAAT GGCCTATCT CAGTAT T CCAGGAACT GCAAAGGT G GGCACT 
T GG GC ATACAAT CTT CAAGCCAAAG C GAACCCAGAAACATTAACTAT T ACAGT AACT T CTCGAG CAGCAAATT CTT CT 
GTGCCTCCAATCACAGTGAATGCTAAAATGAATAAGGACGTAAACAGTTTCCCCAGCCCAATGATTGTTTACGCAGAA 
ATTCTACAAGGATATGTACCTGTTCTTGGAGCCAATGTGACTGCTTTCATTGAATCACAGAATGGACATACAGAAGTT 
TTGGAACTTT TGGATAAT GGT GCAGGCGCT GATT CTTT CAAGAATGATGGAGT CTACT C CAGGTATTTT ACAGCATAT 
ACAGAAAAT GGCAGATAT AGCTTAAAAGT T C GGG CT CAT G GAGGAGCAAACACT GCC AGGCTAAAATTAC GGCCT C CA 
CT GAAT AGAGCCGC GTACATAC CAGGCT GGGTAGT GAACGGGGAAATT GAAGCAAAC C CGCCAAGACCT GAAATTGAT 
GAGGATACTCAGACCACCTTGGAGGATTTCAGCCGAACAGCATCCGGAGGTGCATTTGTGGTATCACAAGTCCCAAGC 
CTT CCCTT GCCTGACC AATACCCACCAAGT CAAATCACAGACCT T GAT GCCAC AGTT CAT GAGGATAAGATTAT TCTT 
ACAT GGACAGCACCAGGAGAT AATTT T GATGTT GGAAAAGTT CAAC GTTATAT CATAAGAATAAGTGCAAGTATTCTT 
GATCTAAGAGAC1AGTTTTGATGATGCTCTTCAAGTAAATACTACTGATCTGTCACCAAAGGAGGCCAACTCCAAGGAA 
AGCTTTGCATT TAAAC CAGAAAATAT CTCAGAAGAAAAT GCAACCCACATAT TTAT T GCCATTAAAAGTATAGATAAA 
AGCAATTTGAC^TCAAAAGTATCCAACATTGCACAAGTAACTTT GTTTATCCCTCAAGCAAAT CCT GATGACATTGAT 
C CTACAC CTACT C C TACT CCTACT CCTAC T CCT GATAAAAGT CATAATT CT GGAGT T AATAT T T CTAC GCT GGT AT T G 
TCTGTGATTGGGTCTGTTGTAATTGTTAACTTTATTTTAAGTACCACC^TTTGAACCTTAACGAAGAAAAAAATCTTC 
AAGTAGACCT AGAAGAGAGTT TT AAAAAACAAAACAAT GTAAGTAAAGGATATTT CT GAAT CTT AAAATTCATC CC AT 
GTGTGATC^TAAACTCATAAAAATAATTTTAAGATGTCGGAAAAGGATACTTTGATTAAATAAAAACACTCATGGATA 
TGTAAAAACTGTCAAGATTAAAATTTAATAGTTTCATTTATTTGTTATTTTATTTGTAAGAAATAGTGATGAACAAAG 
ATCCTTTTTCATACTGATACCTGGTTGTATATTATTTGATGCAAC^GTTTTCTGAAATGATATTTCAAATTGCATCAA 
GAAATTAAAAT CAT CTAT CTGAGTAGT CAAAATAC AAGT AAAGGAGAGCAAATAAACAACAT TT GGAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 7Q 

MGLFRGFVFLLVLCLLHQSNTSFIKLNNNGFEDIVIVIDPSVPEDEKIIEQIEDMVTTASTYLFE 
ATEKRFFFKNVSILIPENWKENPQYKRPKHENHKHADVIVAPPTLPGRDEPYTKQFTECGEKGEY 
IHFTPDLLLGKKQNEYGPPGKLFVHEWAHLRWGVFDEYNEDQPFYRAKSKKIEATRCSAGISGRN 
RVYKCQGGSCLSRACRIDSTTKLYGKDCQFFPDKVQTEKASIMFMQSIDSWEFCNEKTHNQEAP 
SLQNIKCNFRSTWEVISNSEDFKNTIPMVTPPPPPVFSLLKISQRIVCLVLDKSGSMGGKDRLNR 
MNQAAKHFLLQTVENGSWGMVHFDSTATIVNKLIQIKSSDERNTLMAGLPTYPLGGTSICSGIK 
YAFQVIGELHSQLDGSEVLLLTDGEDNTASSCIDEVKQSGAIVHFIALGRAADEAVIEMSKITGG 
SHFYVSDEAQNNGLIDAFGALTSGNTDLSQKSLQLESKGLTLNSNAWMNDTVIIDSTVGKDTFFL 
ITWNSLPPSISLWDPSGTIMENFTVDATSKMAYLSIPGTAKVGTWAYNLQAKANPETLTITVTSR 
AANSSVPPITVNAKMNKDVNSFPSPMIVYAEILQGYVPVLGANVTAFIESQNGHTEVLELLDNGA 
GADSFKNDGVYSRYFTAYTENGRYSLKVRAHGGANTARLKLRPPLNRAAYIPGWVVNGEIEANPP 
RPEIDEDTQTTLEDFSRTASGGAFWSQVPSLPLPDQYPPSQITDLDATVHEDKIILTWTAPGDN 
FDVGKVQRYI IRI SAS ILDLRDS FDDALQVNTTDLSPKEANSKES FAFKPENI SEENATHI FI AI 
KSIDKSNLTSKVSNIAQVTLFIPQANPDDIDPTPTPTPTPTPDKSHNSGVNISTLVLSVIGSWI 
VNFILSTTI 

Signal peptide: 

amino acids 1-21 

Putative transmembrane domains: 

amino acids 284-300, 617-633 

Leucine zipper pattern. 

amino acids 469-491, 476-498 

N-glycosylation site. 

amino acids 20-24, 75-79, 340-344, 504-508, 542-546, 588-592, 
628-632, 811-815, 832-836, 837-841, 852-856, 896-900 
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FIGURE 71 



CT CCT TAGGT GGAAACCCTGGGAGTAGAGT ACTGACAGCAAAGACCGGGAAAGAC CATACGTC CCC GGGCAGGG GT GA 
CAACAGGTGTCATCTTTTTGATCTCGTGTGTGGCTGCCTTCCTATTTCAAGGAAAGACGCCAAGGTAATTTTGACCCA 
GAGGAGCAAT GAT GTAG CCACGT CCTAACCT TC CCT TC T T GAACCCCCAGTTATG CCAGGATTTACTAGAGAGT GT CA 
ACTCAACCAGCAA.GCGGCTCCTTCGGCTTAACTTGTGGTTGGAGGAGAGAACCTTTGTGGGGCTGCGTTCTCTTAGCA 
GT GCT CAGAAGT GACTT GCCT GAGGGTGGAC CAGAAGAAAG GAAAG GT CC C CT CTT GCT GTTGGCT GCAC ATCAGGAA 
GGCT GT GAT GGGAAT GAAGGT GAAAACT T GGAGATTT CACTTCAGT CATT GCTT C T GC CT GCAAGATCAT CCT T T AAA 
AGTAGAGAAGCTGCT CT GT GT GGT GGTTAACTC CAAGAGGCAGAACT C GTT CT AGAAGGAAAT GGATGCAAGCAGCTC 
CGGGGGCCCCAAACGCATGCTTCCTGTGGTCTAGCCCAGGGAAGCCCTTCCGTGGGGGCCCCGGCTTTGAGGGATGCC 
ACCGGTTCTGGACGCATGGCTGATTCCTGAATGATGATGGTTCGCCGGGGGCTGCTTGCGTGGATTTCCCGGGTGGTG 
GTTTTGCTGGTGCTCCTCTGCTGTGCTATCTCTGTCCTGTACATGTTGGCCTGCACCCCAAAAGGTGACGAGGAGCAG 
CTGGCACT GCCCAGGGCCAACAGC CC CAC GGGGAAGGAGGGGTACCAGGC C GT CCT T CAGGAGT GGGAGGAGCAGCAC 
U C GCAACTAC GTGAGCAGCCT GAAGCGGCAGAT CGCACAGCT CAAGGAGGAGCT GCAGGAGAGGAGT GAGCAGCT CAGG 

l Si . AAT GGGCAGTACCAAGCCAGCGAT GCT G CT GGC CTGGGT CT GGACAGGAGCCC CC CAGAGAAAACCCAGGCCGACCT C 

-%J CT GG CCTT C CT GCACT CG CAGGT G GACAAGGCAGAGGT GAAT GCTGGCGTCAAGCT GGC CACAGAGTAT GCAGCAGT G 

fll C CTTT CGATAGCTTTACT CTACAGAAGGT GTAC CAGCT GGAGACT GGC CTTACCCGC CACC CC GAGGAGAAGCCTGT G 

AGGAAG GACAAGCGGGATGAGTTGGTGGAAGCCATTGAATCAGCCTTGGAGACCCTGAACAATCCTGCAGAGAACAGC 
V" ccc ^ TCACCGTC CTTACACGGCCTCTGATTTCATAGAAGGGATCTACCGAACAGAAAGGGACAAAGGGACATTGTAT 
III GAGCTCACCTTCAAAGGGGACCACAAACACGAATTCAAACGGCTCATCTTATTTCGACCATTCAGCCCC^ 
5 S3 GTGAAAAAT GAAAAGCT CAACAT GGCCAACAC GCTTAT CAAT GTTAT C GTGCCT CTAGCAAAAAGGGT GGACAAGTT C 

Wfe CGGCAGTT CAT GCAGAATT TCAGG GAGAT GT GCATT GAGCAGGAT GGGAGAGT CCAT CT CACT GTT GTT TACTT TGGG 

fl! AAAGAAGAAATAAATGAAGT CAAAGGAATACT T GAAAACACTT C CAAAGCT GC CAACT T CAGGAACTT TACCT T CAT C 

h ; CAGCTGAATGGAGAATTTTCTCGGGGAAAGGGACTTGATGTTGGAGCCCGCTTCTGGAAGGGAAGCAACGTCCTTCTC 
TT T TTCT GT GAT GT GGACAT CT ACTT CACAT CT GAATT CCT CAATACGT GT AGGCT GAATACACAGCCAGGGAAGAAG 
4S GTATTT TAT CCAGTTCT TTTCAGT CAGT ACAAT CCT GGCATAAT AT AC GGC CACCAT GATGCAGT CCCT CCCTT GGAA 

CAGCAGCT GGTCATAAAGAAGGAAACT GGATT TT GGAGAGACT TT G GATTT GGGAT GAC GT GTCAGT AT CGGT CAGAC 
J% TTCATCAATATAGGTGGGTT TGAT CT GGACAT CAAAGGCT G GGGCGGAGAGGAT GT G CACCTTTATCGCAAGTATCT C 

Us CACAGCAACCTCATAGT GGTAC GGACGC CT GT GCGAGGACT CTT CCAC CTCT GGCAT GAGAAGC GCT GCAT GGACGAG 

f*% CT GACCC CCGAGCAGT ACAAGAT GTGCAT GCAGT CCAAG GCCAT GAAC GAGGCATCC CACGGCCAGCT GGGCAT GCT G 

%^ GT GT TCAGGCACGAGATAGAGGCT CAC CTT C GCAAACAGAAACAGAAGACAAGT AGCAAAAAAACATGAACT CC CAGA 

fll GAAGGAT T GTGGGAGACACTT TTT CT TTCCT TTT GCAAT TACTGAAAGT GGCTGCAACAGAGAAAAGACT T CCAT AAA 

A G GACGACAAAAGAATT GGACT GAT GGGTCAGAGAT GAGAAAG CCT C CGATTT CT CT CT GTT GGGCT TTTT ACAACAGA 

AAT CAAAAT CT CC GCTT T GCCT GCAAAAGTAACCCAGTT GCACC CT GTGAAGTGT CT GACAAAGGCAGAAT GCTT GT G 
f|I AGAT TATAAGCCTAAT GGT GT GGAGGTTTT GATGGT GTTT ACAAT ACACT GAGACCT GTT GT TTT GT GT GCT CATT GA 

AATATT CAT GAT TTAAGAGCAGTTT TGTAAAAAATT CAT TAGCAT GAAAG GCAAGCAT ATT T CT CCT CATAT GAAT GA 
GCCTAT CAGCAGGGCT CTAGTT TCTAG GAAT GCTAAAAT AT CAGAAGGCAGGAGAG GAGAT AGGCTTAT TAT GATACT 
AGTGAGTACATTAAGTAAAATAAAAT GGACCA.GAAAAGAAAAGAAACCAT AAAT AT CGT GT CAT ATT TT C CC CAAGAT 
TAACCAAAAATAATCTGCTTATCTTTTTGGTTGTCCTTTTAACTGTCTCCGTTTTTTTCTTTTATTTAAAAATGCACT 
TT TT TT CCCTT GTGAGT TATAGTCT GCTTAT T TAAT TACCACTTT GCAAGCCTT ACAAGAGAGCACAAGT T G G CCT AC 
AT TTTTATATTTTTTAAGAAGATACT TTGAGATGCATTAT GAGAACTTT CAGTT CAAAGCAT CAAATT GATGCCAT AT 
C CAAGGACAT GC CAAAT GCT GATT CT GTCAGGCACT GAAT GT CAGG CATT GAGACATAGGGAAGGAATGGTTTGT ACT 
AATACAGAC GTACAGATAC TTT CT C T GAAGAGTATTT TCGAAGAGGAGCAACTGAACACT GGAGGAAAAGAAAAT GAC 
ACTT T CT GCTTTACAGAAAAG GAAACT CAT TCAGACT GGT GATAT C GTGAT GTAC CTAAAAGT C AGAAACCACAT TTT 
CTCCTCAGAAGTAGGGACCGCTTTCTTACCTGTTTAAATAAACCAAAGTATACCGTGTGAACCAAACAATCTCTTTTC 
AAAACAGGGT GCTC CT C CT GGCTT CT GGCT T C CATAAGAAGAAAT GGAGAAAAATAT ATAT AT ATATATATAT ATTGT 
GAAAGAT CAAT CCAT CTGCCAGAATCTAGT GGGAT GGAAGTTTTTGCTACAT GTTAT CCACCCCAGGC CAGGT GGAAG 
TAACT GAATTATTT TTTAAATTAAGCAGTT CTACT CAAT CAC CAAGAT GCTT CTGAAAATTGCATTTT AT TACCATTT 
CAAACT ATTT TTTAAAAAT AAATACAGTTAACATAGAGT GGTT T CTT CATT CAT GT GAAAAT TAT T AGCCAGCACCAG 
AT GCAT GAGCT AATTAT CT CTTT GAGT CC T TG CT T CT GTTT GCT CACAGTAAACT CAT T GTT T AAAAGCTTCAAGAAC 
AT TCAAG CT GTT GGT GT GT TAAAAAAT GCATT GTATT GATTT GTACT GGTAGTTT AT GAAATTTAAT TAAAACACAGG 
CCAT GAAT GGAAGGT GGTATTGCACAGC T AATAAAATAT GAT TT GTGGATAT GAA 
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FIGURE 72 



MMMVRRGLLAWI SRVVVLLVLLCCAI SVLYMLACTPKGDEEQLALPRANSPTGKEGYQAVLQEWE 
EQHRNYVSSLKRQIAQLKEELQERSEQLRNGQYQASDAAGLGLDRSPPEKTQADLLAFLHSQVDK 
AEVNAGVKLATEYAAVPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDELVEAIESALETLNNPA 
ENSPNHRPYTASDFIEGIYRTERDKGTLYELTFKGDHKHEFKRLILFRPFSPIMKVKNEKLNMAN 

TLINVIVPLAKRVDKFRQFMQNFR^ 

T FI QLNGE FSRGKGLDVGARFWKGSNVLL FFCDVD I YFT SE FLNTCRLNTQPGKKVFYPVLFS QY 
NPGIIYGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYR 
KYLHSNLIWRTPTOGLFHLWHEKRCMDELTPEQYKMCMQSKAiyLNEASHGQLGMLVFRHEIEAHL 

RKQKQKT S SKKT 

Important features: 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites. 

amino acids 315-319, 324-328 

N-myristoylation sites. 

amino acids 96-102, 136-142, 212-218, 311-317, 339-345, 393-399 

Amidation site. 

amino acids 377-381 
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FIGURE 7a 



GAGACTGCAGAGGGAGATAAAGAGAGAGGGCAAAGAGGCAGCAAGAGATTTGTCCTGGGGATCCA 

GAAACCCATGATACCCTACTGAACACCGAATCCCCTGGAAGCCCACAGAGACAGAGACAGCAAGA 

GAAGCAGAGATAAATACACTCACGCCAGGAGCTCGCTCGCTCTCTCTCTCTCTCTCTCACTCCTC 

CCTCCCTCTCTCTCTGCCTGTCCTAGTCCTCTAGTCCTCAAATTCCCAGTCCCCTGCACCCCTTC 

CTGGGACACTATGTTGTTCTCCGCCCTCCTGCTGGAGGTGATTTGGATCCTGGCTGCAGATGGGG 

GTCAACACTGGACGTATGAGGGCCCACATGGTCAGGACCATTGGCCAGCCTCTTACCCTGAGTGT 

GGAAACAAT GCCCAGT CGCCCATCGATAT TCAGACAGACAGTGT GACAT TT GACCCTGATT TGCC 

TGCT CTGCAGCC CCACGGAT AT GACCAGC C T GGCACCGAGCCT T T GGAC C T GCACAACAATGGCC 

ACACAGTGCAACTCTGTCTGCCCTCTACCCTGTATCTGGGTGGACTTCCCCGAAAATATGTAGCT 

GCCCAGCTCCACCTGCACTGGGGTCAGAAAGGATCCCCAGGGGGGTCAGAACACCAGATCAACAG 

TGAAGCCACATTTGCAGAGCTCCACATTGTACATTATGACTCTGATTCCTATGACAGCTTGAGTG 

AGGCTGCTGAGAGGCCTCAGGGCCTGGCTGTCCTGGGCATCCTAATTGAGGTGGGTGAGACTAAG 

AATATAGCT TATGAACACATTCTGAGTCACTTGCATGAAGT CAGGCATAAAGATCAGAAGACCTC 

AGTGC C TCCC T T CAACCTAAGAGAGCT GCT CCCCAAACAGCT GGGGCAGTACT T C CGCTACAATG 

GCTCGCTCACAACTCCCCCTTGCTACCAGAGTGTGCTCTGGACAGTTTTTTATAGAAGGTCCCAG 

AT TTCAATGGAACAGCTGGAAAAGCT TCAGGGGACATT GTT CTCCACAGAAGAGGAGCCCTCTAA 

GCTTCTGGTACAGAACTACCGAGCCCTTCAGCCTCTCAATCAGCGCATGGTCTTTGCTTCTTTCA 

TCCAAGCAGGATCCTCGTATACCACAGGTGAAATGCTGAGTCTAGGTGTAGGAATCTTGGTTGGC 

TGTCTCTGCCTTCTCCTGGCTGTTTATTTCATTGCTAGAAAGATTCGGAAGAAGAGGCTGGAAAA 

CCGAAAGAGTGTGGTCTTCACCTCAGCACAAGCCACGACTGAGGCATAAATTCCTTCTCAGATAC 

CATGGATGTGGATGACTTCCCTTCATGCCTATCAGGAAGCCTCTAAAATGGGGTGTAGGATCTGG 

CCAGAAACACTGTAGGAGTAGTAAGCAGATGTCCTCCTTCCCCTGGACATCTCTTAGAGAGGAAT 

GGACCCAGGCTGTCAT TCCAGGAAGAAC TGCAGAGCCT TCAGCCTCTCCAAACATGTAGGAGGAA 
ATGAGGAAATCGCTGTGTTGTTAATGCAGAGANCAAACTCTGTTTAGTTGCAGGGGAAGTTTGGG 
ATATACCCCAAAGTCCTCTACCCCCTCACTTTTATGGCCCTTTCCCTAGATATACTGCGGGATCT 
C TC C T T AGGATAAAGAGT T GCTGT T GAAG TTGT AT AT T T T TGAT CAATATAT T T G GAAAT TAAAG 
TTTCTGACTTT 



FIGURE 74 



MLFSALLLEVIWILAADGGQHWTYEGPHGQDHWPASYPECGNNAQSPIDIQTDSVTFDPDLPALQ 
PHGYDQPGTEPLDLHNNGHTVQLSLPSTLYLGGLPRKYVAAQLHLHWGQKGSPGGSEHQINSEAT 
FAELHIVHYDSDSYDSLSEAAERPQGLAVLGILIEVGETKNIAYEHILSHLHEVRHKDQKTSVPP 
FNLRELLPKQLGQYFRYNGSLTTPPCYQSVLWTVFYRRSQISMEQLEKLQGTLFSTEEEPSKLLV 
QN YRALQP LNQRMVFAS F I QAG S S YT T GEML SLGVG I LVGCLCLLLAVYF I ARKI RKKRLENRKS 
WFTSAQATTEA 

Important features of the protein: 
Signal peptide: 

amino acids 1-15 

Transmembrane domain: 

amino acids 291-310 

N-glycosylation site. 

amino acids 213-216 

Eukaryo tic- type carbonic anhydrases proteins 

amino acids 197-245, 104-140, 22-69 
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FIGURE 75 

TGCCGCTGCCGCCGCTGCTGCTGTTGCTCCTGGCGGCGCCTTGGGGACGGGCAGTTCCCTGTGTC 
TCTGGTGGTTTGCCTAAACCTGCAAACATCACCTTCTTATCCATCAACATGAAGAATGTCCTACA 

ATGGACTCCACCAGAGGGTCTTCAAGGAGT TAAAGTTACTTACAC TGTGCAGTAT TTCATCACAA 
ATTGGCCCACCAGAGGTGGCACTGACTACAGATGAGAAGTCCATTTCTGTTGTCCTGACAGCTCC 
AGAGAAGTGGAAGAGAAATCCAGAAGACCTTCCTGTTTCCATGCAACAAATATACTCCAATCTGA 
AGTATAACGTGTCTGTGTTGAATACTAAATCAAACAGAACGTGGTCCCAGTGTGTGACCAACCAC 
ACGCTGGTGCTCACCTGGCTGGAGCCGAACACTCTTTACTGCGTACACGTGGAGTCCTTCGTCCC 
AGGGCCCCC TCGCCGT GCT CAGC C T T CTGAGAAGCAG T GT GCCAGGAC T TT GAAAGAT CAATCAT 
CAGAGTTCAAGGCTAAAATCATCTTCTGGTATGTTTTGCCCATATCTATTACCGTGTTTCTTTTT 
TCTGTGATGGGCTATTCCATCTACCGATATATCCACGTTGGCAAAGAGAAACACCCAGCAAATTT 
GAT T T T GAT T TAT GG AAAT G AAT T T G ACAAAAG AT TCTTTGTGCCTGCT GAAAAAAT C G T GAT TA 
AC T T TATCACCCT CAATAT CTCGGATGAT TC TAAAAT T TCTCAT CAGGATAT GAGT TT ACT GGGA 
AAAAGCAGTGATGTATCCAGCCTTAATGATCCTCAGCCCAGCGGGAACCTGAGGCCCCCTCAGGA 
GGAAGAGGAGGT GAAACAT T T AGGGTATGC T T CGCAT T TGAT GGAAATT T T T T GT GACT C T GAAG 
AAAACACGGAAGGTACTTCTCTCACCCAGCAAGAGTCCCTCAGCAGAACAATACCCCCGGATAAA 
ACAGT CAT T GAATATGAATATGAT GT CAGAACCACTGACAT T TGT GCGGGGCCTGAAGAGCAGGA 
GCTCAGT TT GCAGGAGGAGGTGTCCACACAAGGAACAT TAT T GGAGTCGCAGGCAGCGT T GGCAG 
TCTTGGGCCCGCAAACGTTACAGTACTCATACACCCCTCAGCTCCAAGACTTAGACCCCCTGGCG 
CAGGAGCACACAGACTCGGAGGAGGGGCCGGAGGAAGAGCCATCGACGACCCTGGTCGACTGGGA 
TCCCCAAACTGGCAGGCTGTGTATTCCTTCGCTGTCCAGCTTCGACCAGGATTCAGAGGGCTGCG 
AGCCTTCTGAGGGGGATGGGCTCGGAGAGGAGGGTCTTCTATCTAGACTCTATGAGGAGCCGGCT 
CCAGACAGGCCACCAGGAGAAAATGAAACCTATCTCATGCAATTCATGGAGGAATGGGGGTTATA 
TGTGCAGATGGAAAAC TGA TGCCAACACTTCCTTTTGCCTTTTGTTTCCTGTGCAAACAAGTGAG 
TCACCCCTTTGATCCCAGCCATAAAGTACCTGGGATGAAAGAAGTTTTTTCCAGTTTGTCAGTGT 
CTGTGAGAATTACTTATTTCTTTTCTCTATTCTCATAGCACGTGTGTGATTGGTTCATGCATGTA 
GGTCTCTTAACAATGATGGTGGGCCTCTGGAGTCCAGGGGCTGGCCGGTTGTTCTATGCAGAGAA 
AGCAG TCAATAAAT GT T TGCCAGACTGGGTGCAGAAT T TAT T CAGGTGGGTGT 
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FIGURE 76 



MS YNGLHQRVFKELKLLTLCS I S SQIGPPEVALTTDEKS I S WLTAPEKWKRNPEDLPVSMQQI Y 
SNLKYNVSVLNTKSNRTWSQCVTOTTLVLTWLEPNTLYCVHVESFVPGPPRRAQPSEKQCARTLK 
DQSSEFKAKIIFWYVLPISITVFLFSVMGYSIYRYIHVGKEKHPANLILIYGNEFDKRFFVPAEK 
IVINFITLNISDDSKISHQDMSLLGKSSDVSSLNDPQPSGNLRPPQEEEEVKHLGYASHLMEIFC 
DSEENTEGTSLTQQESLSRTIPPDKTVIEYEYDVRTTDICAGPEEQELSLQEEVSTQGTLLESQA 
ALAVLGPQTLQYSYTPQLQDLDPLAQEHTDSEEGPEEEPSTTLVDWDPQTGRLCIPSLSSFDQDS 
EGCEPSEGDGLGEEGLLSRLYEEPAPDRPPGENETYLMQFMEEWGLYVQMEN 

Important features: 
Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 140-163 

N-glycosylation sites. 

amino acids 71-74, 80-83, 89-92, 204-207, 423-426 
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FIGURE 77 



GAGGAGCGGGCCGAGGACTCCAGCGTGCCCAGGTCTGGCATCCTGCACTTGCTGCCCTCTGACAC 
CTGGGAAGATGGCCGGCCCGTGGACCTTCACCCTTCTCTGTGGTTTGCTGGCAGCCACCTTGATC 
CAAGCCACCCTCAGTCCCACTGCAGTTCTCATCCTCGGCCCAAAAGTCATCAAAGAAAAGCTGAC 
ACAGGAGCTGAAGGACCACAACGCCACCAGCATCCTGCAGCAGCTGCCGCTGCTCAGTGCCATGC 
GGGAAAAGCCAGCCGGAGGCATCCCTGTGCTGGGCAGCCTGGTGAACACCGTCCTGAAGCACATC 
ATCTGGCTGAAGGTCATCACAGCTAACATCCTCCAGCTGCAGGTGAAGCCCTCGGCCAATGACCA 
GGAGCTGCTAGTCAAGATCCCCCTGGACATGGTGGCTGGATTCAACACGCCCCTGGTCAAGACCA 
O T CGTGGAGT TCCACAT GACGAC TGAGGCCCAAGCCACCAT CCGCATGGACACCAGT GCAAGT GGC 

W CCCACCCGCCTGGTCCTCAGTGACTGTGCCACCAGCCATGGGAGCCTGCGCATCCAACTGCTGTA 

in 

Tl TAAGCTCTCCTTCCTGGTGAACGCCTTAGCTAAGCAGGTCATGAACCTCCTAGTGCCATCCCTGC 

w 

iff CCAATCTAGTGAAAAACCAGCTGTGTCCCGTGATCGAGGCTTCCTTCAATGGCATGTATGCAGAC 
|f| CTCCTGCAGCTGGTGAAGGTGCCCATTTCCCTCAGCATTGACCGTCTGGAGTTTGACCTTCTGTA 
H TCCTGCCATCAAGGGTGACACCATTCAGCTCTACCTGGGGGCCAAGTTGTTGGACTCACAGGGAA 
AGGTGACCAAGTGGTTCAATAACTCTGCAGCTTCCCTGACAATGCCCACCCTGGACAACATCCCG 
sjjj TTCAGCCTCATCGTGAGTCAGGACGTGGTGAAAGCTGCAGTGGCTGCTGTGCTCTCTCCAGAAGA 
^ ATTCATGGTCCTGTTGGACTCTGTGCTTCCTGAGAGTGCCCATCGGCTGAAGTCAAGCATCGGGC 
f|J TGATCAATGAAAAGGCTGCAGATAAGCTGGGATCTACCCAGATCGTGAAGATCCTAACTCAGGAC 
D ACTCCCGAGTTTTTTATAGACCAAGGCCATGCCAAGGTGGCCCAACTGATCGTGCTGGAAGTGTT 
H * TCCCTCCAGTGAAGCCCTCCGCCCTTTGTTCACCCTGGGCATCGAAGCCAGCTCGGAAGCTCAGT 
T TTACACCAAAGGTGACCAACTTATACTCAACTT GAATAACATCAGCTCTGATCGGATCCAGCTG 
ATGAACTCTGGGAT TGGCT GGTTCCAACCTGAT GT TCTGAAAAACATCATCACT GAGATCATCCA 
CTCCATCCTGCTGCCGAACCAGAATGGCAAATTAAGATCTGGGGTCCCAGTGTCATTGGTGAAGG 
CCTTGGGATTCGAGGCAGCTGAGTCCTCACTGACCAAGGATGCCCTTGTGCTTACTCCAGCCTCC 
TTGTGGAAACCCAGCTCTCCTGTCTCCCAGTGAAGACTTGGATGGCAGCCATCAGGGAAGGCTGG 
GTCCCAGCTGGGAGTATGGGTGTGAGCTCTATAGACCATCCCTCTCTGCAATCAATAAACACTTG 
CCTGTGAAAAA 
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FIGURE 78 

MAGPWTFTLLCGLI^ATLIQATLSPTAVLILGPKVIKEKLTQELKDHNATSILQQLPLLSAMREK 
PAGGI PVLGSLVNTVLKH 1 1 WLKVI TANI LQLQVKPSANDQELLVKI PLDMVAGFNT PLVKT I VE 
FHMT TEAQAT I RMDT S ASGPTRLVLSDCAT SHGS LRI QLLYKLS FLVNALAKQVMNLLVPSLPNL 
VKNQLCP VIEAS FNGMYADLLQLVKVP ISLSI DRLE FDLLYPAIKGDT I QL YLGAKLLDSQGKVT 
KWFNNSAASLTMPTLDNIPFSLIVSQDWKAAVAAVLSPEEFMVLLDSVLPESAHRLKSSIGLIN 
EKAADKLGSTQIVKILTQDTPEFFIDQGHAKVAQLIVLEVFPSSEALRPLFTLGIEASSEAQFYT 
^ KGDQLILNLNNISSDRIQLMNSGIGWFQPDVLKNIITEIIHSILLPNQNGKLRSGVPVSLVKALG 
FEAAE S S L TKDALVL T PAS LWKP S S PVSQ 

yj Important features of the protein: 

¥J Signal peptide: 

,7." amino acids 1-21 

Q N-glycosylation sites. 

Ifl amino acids 48-51, 264-267, 401-404 

IJf Glycosaminoglycan attachment site. 

fU amino acids 412-415 

% "SiP 

LBP / BPI / CETP family proteins. 

amino acids 407-457 
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FIGURE 7Q 



GAGAGAAGTCAGCCT GGCAGAGAGACTC T GAAAT GAGGGAT TAGAGGT GT TCAAGGAGCAAGAGC 

TTCAGCCTGAAGACAAGGGAGCAGTCCCTGAAGACGCTTCTACTGAGAGGTCTGCCATGGCCTCT 

CTTGGCCTCCAACTTGTGGGCTACATCCTAGGCCTTCTGGGGCTTTTGGGCACACTGGTTGCCAT 

GCTGCTCCCCAGCTGGAAAACAAGTTCTTATGTCGGTGCCAGCATTGTGACAGCAGTTGGCTTCT 

CCAAGGGCC TCT GGAT GGAAT GTGCCACACACAGCACAGGCATCACCCAGT GTGACAT CT AT AGC 

ACCCTTCTGGGCCTGCCCGCTGACATCCAGGCTGCCCAGGCCATGATGGTGACATCCAGTGCAAT 

CTCCTCCCTGGCCTGCATTATCTCTGTGGTGGGCATGAGATGCACAGTCTTCTGCCAGGAATCCC 

GAGCCAAAGACAGAGTGGCGGT AGCAGGTGGAGTCT T T T T CAT CCT T GGAGGCCT CCT GGGAT TC 

ATTCCTGTTGCCTGGAATCTTCATGGGATCCTACGGGACTTCTACTCACCACTGGTGCCTGACAG 

CATGAAATTTGAGATTGGAGAGGCTCTTTACTTGGGCATTATTTCTTCCCTGTTCTCCCTGATAG 

CTGGAATCATCCTCTGCTTTTCCTGCTCATCCCAGAGAAATCGCTCCAAGTACTACGATGGCTAC 

CAAGCCCAACCTCTTGCCACAAGGAGCTCTCCAAGGCCTGGTCAACCTCCCAAAGTCAAGAGTGA 

GTTCAATTCCTACAGCCTGACAGGGTATGTGTGAAGAACCAGGGGCCAGAGCTGGGGGGTGGCTG 

GGTCTGTGAAAAACAGTGGACAGCACCCCGAGGGCCACAGGTGAGGGACACTACCACTGGATCGT 

GTCAGAAGGTGCTGCTGAGGATAGACTGACTTTGGCCATTGGATTGAGCAAAGGCAGAAATGGGG 

GCTAGTGTAACAGCATGCAGGTTGAATTGCCAAGGATGCTCGCCATGCCAGCCTTTCTGTTTTCC 

TCACCTTGCTGCTCCCCTGCCCTAAGTCCCCAACCCTCAACTTGAAACCCCATTCCCTTAAGCCA 

GGACTCAGAGGATCCCTTTGCCCTCTGGTTTACCTGGGACTCCATCCCCAAACCCACTAATCACA 

TCCCACTGACTGACCCTCTGTGATCAAAGACCCTCTCTCTGGCTGAGGTTGGCTGTTAGCTCATT 

GCTGGGGATGGGAAGGAGAAGCAGTGGCTTTTGTGGGCATTGCTCTAACCTACTTCTCAAGCTTC 

CCTCCAAAGAAACTGATTGGCCCTGGAACCTCCATCCCACTCTTGTTATGACTCCACAGTGTCCA 

GAG TAAT T T GT GCATG AAC T GAAAT AAAACCAT CC TACGGTATCCAGGGAACAGAAAGCAGGATG 

C AGGAT G GGAG GACAG GAAG G C AGC C T GGGAC AT T TAAAAAAAT A 
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FIGURE 80 



MAS LGLQLVGYI LGLLGLLGTLVAMLLPSWKTS S YVGAS I VTAVG FSKGLWMECATHS TG I TQCD 
I YSTLLGLPADI QAAQAMMVTS SAIS SLAC I ISWGMRCTVFCQESRAKDRVAVAGGVFFILGGL 
LGFIPVAWNLHGILRDFYSPLVPDSMKFEIGEALYLGIISSLFSLIAGIILCFSCSSQRNRSNYY 
DAYQAQPLATRSSPRPGQPPKVKSEFNSYSLTGYV 

Important features of the protein: 
Signal peptide: 

amino acids 1-24 



m 



Transmembrane domains: 

amino acids 82-102, 117-140, 163-182 

N-glycosylation site. 

amino acids 190-193 

PMP-22 / EMP / MP20 family proteins. 

amino acids 4 6-59 
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FIGURE 8l 



CCCACGCGTCCGCGCCTCTCCCTTCTGCTGGACCTTCCTTCGTCTCTCCATCTCTCCCTCCTTTC 
CCCGCGTTCTCTTTCCACCTTTCTCTTCTTCCCACCTTAGACCTCCCTTCCTGCCCTCCTTTCCT 
GCCCACCGCTGCTTCCTGGCCCTTCTCCGACCCCGCTCTAGCAGCAGACCTCCTGGGGTCTGTGG 
GTTGATCTGTGGCCCCTGTGCCTCCGTGTCCTTTTCGTCTCCCTTCCTCCCGACTCCGCTCCCGG 
ACCAGCGGCCTGACCCTGGGGAAAGGATGGTTCCCGAGGTGAGGGTCCTCTCCTCCTTGCTGGGA 
CTCGCGCTGCTCTGGTTCCCCCTGGACTCCCACGCTCGAGCCCGCCCAGACATGTTCTGCCTTTT 
CCATGGGAAGAGATACTCCCCCGGCGAGAGCTGGCACCCCTACTTGGAGCCACAAGGCCTGATGT 
-J"! ACTGCCTGCGCTGTACCTGCTCAGAGGGCGCCCATGTGAGTTGTTACCGCCTCCACTGTCCGCCT 
ff| GTCCACTGCCCCCAGCCTGTGACGGAGCCACAGCAATGCTGTCCCAAGTGTGTGGAACCTCACAC 
id TCCCTCTGGACTCCGGGCCCCACCAAAGTCCTGCCAGCACAACGGGACCATGTACCAACACGGAG 
Ml AGATCTTCAGTGCCCATGAGCTGTTCCCCTCCCGCCTGCCCAACCAGTGTGTCCTCTGCAGCTGC 
f! ACAGAGGGCCAGATCTACTGCGGCCTCACAACCTGCCCCGAACCAGGCTGCCCAGCACCCCTCCC 
* AC TGC C AGAC TCCTGCTGC C AAGC C T GCAAAGAT GAG G C AAG T GAGC AAT C G GAT GAAGAG GACA 

O GTGTGCAGTCGCTCCATGGGGTGAGACATCCTCAGGATCCATGTTCCAGTGATGCTGGGAGAAAG 
Ifl AGAGGCCCGGGCACCCCAGCCCCCACTGGCCTCAGCGCCCCTCTGAGCTTCATCCCTCGCCACTT 
W CAGACCCAAGGGAGCAGGCAGCACAACTGTCAAGATCGTCCTGAAGGAGAAACATAAGAAAGCCT 
P GTGTGCATGGCGGGAAGACGTACTCCCACGGGGAGGTGTGGCACCCGGCCTTCCGTGCCTTCGGC 
fH CCCTTGCCCTGCATCCTATGCACCTGTGAGGATGGCCGCCAGGACTGCCAGCGTGTGACCTGTCC 
CACCGAGTACCCCTGCCGTCACCCCGAGAAAGTGGCTGGGAAGTGCTGCAAGATTTGCCCAGAGG 
ACAAAGCAGACCCTGGCCACAGTGAGATCAGTTCTACCAGGTGTCCCAAGGCACCGGGCCGGGTC 
CTCGTCCACACATCGGTATCCCCAAGCCCAGACAACCTGCGTCGCTTTGCCCTGGAACACGAGGC 
CTCGGACTTGGTGGAGATCTACCTCTGGAAGCTGGTAAAAGATGAGGAAACTGAGGCTCAGAGAG 
GTGAAGTACCTGGCCCAAGGCCACACAGCCAGAATCTTCCACTTGACTCAGATCAAGAAAGTCAG 
GAAGCAAGACTTCCAGAAAGAGGCACAGCACTTCCGACTGCTCGCTGGCCCCCACGAAGGTCACT 
GGAACGTCTTCCTAGCCCAGACCCTGGAGCTGAAGGTCACGGCCAGTCCAGACAAAGTGACCAAG 
ACATAACAAAGACC TAA CAGTTGCAGATATGAGCTGTATAAT TGTTGT TATTATATAT TAATAAA 
TAAGAAGT T GCAT T ACCCTCAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 82 



MVPEVRVLSSLLGIALLWFPLDSHARARPDMFCLFHGKRYSPGESWHPYLEPQGLMYCLRCTCSE 
GAHVSCYRLHCPPVHCPQPVTEPQQCCPKCVEPHTPSGLRAPPKSCQHNGTMYQHGEIFSAHELF 
PSRLPNQCVLCSCTEGQIYCGLTTCPEPGCPAPLPLPDSCCQACKDEASEQSDEEDSVQSLHGVR 
HPQDPCSSDAGRKRGPGTPAPTGLSAPLSFIPRHFRPKGAGSTTVKIVLKEKHKKACVHGGKTYS 
HGEVWHPAFRAFGPLPCILCTCEDGRQDCQRVTCPTEYPCRHPEKVAGKCCKICPEDKADPGHSE 
I S STRCPKAPGRVLVBTSVSPSPDNLRRFALEHEASDLVE I YLWKLVKDEETEAQRGEVPGPRPH 
SQNLPLDSDQESQEARLPERGTALPTARWPPRRSLERLPSPDPGAEGHGQSRQSDQDITKT 

Signal peptide: 

amino acids 1-25 
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FIGURE 83 



GACAGC TGTGTCT CGATGGAGT AGACTCTC AGAACAGCGC AGTT TGCCCT CC GCT CACGC AG AGCCTC TCC 
GTGGCTTCCGCACCTTGAGCATTAGGCCAGTTCTCCTCTTCTCTCTAATCCATCCGTCACCTCTCCTGTCA 
TCCGTTTCCATGCCGTGAGGTCCATTCACAGAACACATCCATGGCTCTCATGCTCAGTTTGGTTCTGAGTC 
TCCTCAAGCTGGGATCAGGGCAGTGGCAGGTGTTTGGGCCAGACAAGCCTGTCCAGGCCTTGGTGGGGGAG 
GACG CAGCAT TC T CCT GT TTCCTGT CT C CT AAGACC AATGCAGAGGCC AT GGAAGT GCGGTT CTT C AGGGG 
CC AGTT CTCT AGCGT GGT CCACCT CT AC AGGGAC GGGAAGGACC AGCC ATTT AT GC AGAT GCC AC AGT ATC 
AAGGCAGGAC AAAACT GGTGAAGGAT TCT AT TGCGGAGGG GCGC AT CT CT CT GAGG CT GG AAAAC AT TACT 
GTGTTGGATGCTGGCCTCTATGGGTGCAGGATTAGTTCCCAGTCTTACTACCAGAAGGCCATCTGGGAGCT 

AC AGGT GTCAGCACTGGGCTCAGT TCCT CT CATT TC CATC AC GG GAT ATGT T GAT AGAGACAT C CAGCT AC 
TCTGTCAGTCCTCGGGCTGGTTCCCCCGGCCCACAGCGAAGTGGAAAGGTCCACAAGGACAGGATTTGTCC 

AC AGACT C C AGGAC AAAC AGAGAC ATGC AT GGCCTGT TTGAT GT GG AG ATCT CT CTGACCGT CC AAGAGAA 
CGCC GG GAGC AT AT C CT GTTCCAT GCGGCATGCT CATCTGAGCCGAGAGGT G GAATCC AGGGT AC AGAT AG 
GAGATACCTTTTTCGAGCCTATATCGTGGCACCTGGCTACCAAAGTACTGGGAATACTCTGCTGTGGCCTA 
TTTTTTGGCATTGTTGGACTGAAGATTTTCTTCTCCAAATTCCAGTGGAAAATCCAGGCGGAACTGGACTG 
GAGAAG AAAG C ACGGACAGGCAGAAT TGAGAGACGC CCGGAAACAC GC AGTGGAGGT G ACTCT G GATCCAG 
AGACGGCTCACCCGAAGCTCTGCGTTTCTGATCTGAAAACTGTAACCCATAGAAAAGCTCCCCAGGAGGTG 
CCTC ACTCTG AGAAGAGATT T ACAAGGAAGAGT GTGGTGGCT TCT C AG AGTT TCC AAGCAGGGAAAC ATT A 
CTGGGAGGTGGACGGAGG ACAC AAT AAAAGGTGG CGCGTGGG AGT GTG CCGGGATGATGT GGACAGGAGGA 
AGGAGT AC GT GACT TT GTCT CCCGAT CATGGGT ACT GGGT CCTC AGAC TGAATGGAGAAC ATTT GT ATTT C 
ACAT T AAATCCCCGTT T TAT C AGCGT CT T CCCCAGGACCC CACCT ACAAAAATAGGGGTCT TCCTGGACT A 
TGAGTGTGGGACCATCTCCTTCTTCAACATAAATGACCAGTCCCTTATTTATACCCTGACATGTCGGTTTG 
AAGGCTTATTGAGGCCCTACATTGAGTATCCGTCCTATAATGAGCAAAATGGAACTCCCATAGTCATCTGC 
CC AGTCAC CC AGGAAT CAGAGAAAGAGGCCT C TT GGCAAAGGGCCT CT GC AATCCC AG AG AC AAGC AACAG 
TGAGTCCT CCT C AC AGGCAACC ACGC CCTTCC TCCCCAGGGGT GAAATG TAGGATGAATC AC AT CC C ACAT 
TCTTCTTTAGGGATATTAAGGTCTCTCTCCCAGATCCAAAGTCCCGCAGCAGCCGGCCAAGGTGGCTTCCA 
GATGAAGGGGGACTGGCCTGTCCACATGGGAGTCAGGTGTCATGGCTGCCCTGAGCTGGGAGGGAAGAAGG 
CT GAC AT T ACAT T T AGT TT GCT CTC ACT C CAT CT GGCT AAGT GAT CT T GAAATACC ACCTCTCAGGT GAAG 
AACCGT C AGGAAT T CCCATCTC ACAGGC T GT GGT GT AG AT TAAGT AGACAAGGAAT GT GAAT AATGCTT AG 
ATCTTATTGATGACAGAGTGTATCCTAATGGTTTGTTCATTATATTACACTTTCAGTAAAAAAA 
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FIGURE 84 



MAIMLSLVLSLLKLGSGQWQVFGPDKPVQALVGEDAAFSCFLSPKTNAEiyMEVRFFRGQFSSVVH 
L YRDGKDQPFMQMPQYQGRTKLVKDS IAEGRI SLRLENI TVLDAGLYGCRI S SQS YYQKAIWELQ 
VSALGSVPL I S I TGYVDRDIQLLCQS SGWFPRPTAKWKGPQGQDLSTDSRTNRDMHGLFDVE I SL 
TVQENAGSISCSMRHAHLSREVESRVQIGDTFFEPISWHLATKVLGILCCGLFFGIVGLKIFFSK 
FQWKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKSWASQSFQAGKHYWEVDGGHNKRWRVGVCRDDVDRRKEYVTLSPDHGYWVLRLNGEHLYFT 
LNPRFISVFPRTPPTKIGVFLDYECGTISFFNINDQSLIYTLTCRFEGLLRPYIEYPSYNEQNGT 
PIVICPVTQESEKEASWQRASAI PETSNSESSSQATTPFLPRGEM 

Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 239-255 
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FIGURE 8* 

AACAGACGTTCCCTCGCGGeCCTGGCACCTCTAACCCCAGACATGCTGCTGCTGCTGCTGCCCCT 

GCT C TGGGGGAGGGAGAGGGCGGAAGGACAGACAAGTAAACT GC TGACGATGCAGAGT TCCGTGA 

CGGTGCAGGAAGGCCTGTGTGTCCATGTGCCCTGCTCCTTCTCCTACCCCTCGCATGGCTGGATT 

TACCCTGGCCCAGTAGTTCATGGCTACTGGTTCCGGGAAGGGGCCAATACAGACCAGGATGCTCC 

AGTGGCCACAAACAACCCAGCTCGGGCAGTGTGGGAGGAGACTCGGGACCGATTCCACCTCCTTG 

GGGAC CCACATACCAAGAAT TGCACCC TGAGCAT CAGAGAT GCCAGAAGAAGTGATGCGGGGAGA 

TACTTCTTTCGTATGGAGAAAGGAAGTATAAAATGGAATTATAAACATCACGGGCTCTCTGTGAA 

TGTGACAGCCTTGACCCACAGGCCCAACATCCTCATCCCAGGCACCCTGGAGTCCGGCTGCCCCC 

AGAATCTGACCTGCTCTGTGCCCTGGGCCTGTGAGCAGGGGACACCCCCTATGATCTCCTGGATA 

GGGACCTCCGTGTCCCCCCTGGACCCCTCCACCACCCGCTCCTCGGTGCTCACCCTCATCCCACA 

GCCCCAGGACCATGGCACCAGCCTCACCTGTCAGGTGACCTTCCCTGGGGCCAGCGTGACCACGA 

ACAAGACCGTCCATCTCAACGTGTCCTACCCGCCTCAGAACTTGACCATGACTGTCTTCCAAGGA 

GACGGCACAGTATCCACAGTCTTGGGAAATGGCTCATCTCTGTCACTCCCAGAGGGCCAGTCTCT 

GCGCCTGGTCTGTGCAGTTGATGCAGTTGACAGCAATCCCCCTGCCAGGCTGAGCCTGAGCTGGA 

GAGGCCTGACCCTGTGCCCCTCACAGCCCTCAAACCCGGGGGTGCTGGAGCTGCCTTGGGTGCAC 

CTGAGGGATGCAGCTGAATTCACCTGCAGAGCTCAGAACCCTCTCGGCTCTCAGCAGGTCTACCT 

GAACGTCTCCCTGCAGAGCAAAGCCACATCAGGAGTGACTCAGGGGGTGGTCGGGGGAGCTGGAG 

CCACAGCCCTG.GTCTTCCTGTCCTTCTGCGTCATCTTCGTTGTAGTGAGGTCCTGCAGGAAGAAA 

TCGGCAAGGCCAGCAGCGGGCGTGGGAGATACGGGCATAGAGGATGCAAACGCTGTCAGGGGTTC 

AGCCTCTCAGGGGCCCCTGACTGAACCTTGGGCAGAAGACAGTCCCCCAGACCAGCCTCCCCCAG 

CTTCTGCCCGCTCCTCAGTGGGGGAAGGAGAGCTCCAGTATGCATCCCTCAGCTTCCAGATGGTG 

AAGCCTTGGGACTCGCGGGGACAGGAGGCCACTGACACCGAGTACTCGGAGATCAAGATCCACAG 

ATGAGAAACTGCAGAGACTCACCCTGATTGAGGGATCACAGCCCCTCCAGGCAAGGGAGAAGTCA 

GAGGCTGAT TCTT GT AGAATTAACAGCCCTCAACGTGAT GAGCTATGATAACACTAT GAAT TATG 

T GC AGAGT GAAAAGCAC ACAGGC T T TAGAGTCAAAG TAT C T CAAACC T GAATC CACAC T G T GCCC 

TCCCTTTTATTTTTTTAACTAAAAGACAGACAAATTCCTA 
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FIGURE 86 



MLLLLLPLLWGRERAEGQTSKLLTMQSSVTVQEGLCVHVPCSFSYPSHGWIYPGPWHGYWFREG 
ANTDQDAPVATNNPARAVWEETRDRFHLLGDPHTKNCTLSIRDARRSDAGRYFFRMEKGSIKWNY 
KHHRLSVNVTALTHRPNILIPGTLESGCPQNLTCSVPWACEQGTPPMISWIGTSVSPLDPSTTRS 
SVLTLIPQPQDHGTSLTCQVTFPGASVTTNKTVHLNVSYPPQNLTMTVFQGDGTVSTVLGNGSSL 
SLPEGQSLRLVCAVDAVDSNPPARLSLSWRGLTLCPSQPSNPGVLELPWVHLRDAAEFTCRAQNP 
LGSQQVYLNVSLQSKAT S GVTQGWGGAGATALVFLS FCVI FVWRSCRKKSARPAAGVGDTG IE 
DANAVRGSASQGPLTEPWAEDSPPDQPPPASARSSVGEGELQYASLSFQMVKPWDSRGQEATDTE 

YSEIKIHR 

Signal peptide: 

amino acids 1-15 

Transmembrane domain: 

amino acids 351-370 
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FIGURE 87 



AGAAAGCT GCACT CT GT TGAGC T CCAGGGCGCAGTGGAGGGAGGGAGT GAAGGAGCT C T CT GTAC 
CCAAGGAAAGTGCAGCTGAGACTCAGACAAGATTACAATGAACCAACTCAGCTTCCTGCTGTTTC 
TCATAGCGACCACCAGAGGATGGAGTACAGATGAGGCTAATACTTACTTCAAGGAATGGACCTGT 
TCTTCGTCTCCATCTCTGCCCAGAAGCTGCAAGGAAATCAAAGACGAATGTCCTAGTGCATTTGA 
TGGCCTGTATTTTCTCCGCACTGAGAATGGTGTTATCTACCAGACCTTCTGTGACATGACCTCTG 
GGGGTGGCGGCTGGACCCTGGTGGCCAGCGTGCATGAGAATGACATGCGTGGGAAGTGCACGGTG 
U GGCGATCGCTGGTCCAGTCAGCAGGGCAGCAAAGCAGACTACCCAGAGGGGGACGGCAACTGGGC 
O CAACTACAACACCTTTGGATCTGCAGAGGCGGCCACGAGCGATGACTACAAGAACCCTGGCTACT 
tf ACGACATCCAGGCCAAGGACCTGGGCATCTGGCACGTGCCCAATAAGTCCCCCATGCAGCACTGG 
J; ;| AGAAACAGC TCCCTGCT GAGGTACCGCACGGACACT GGC T T CC TCCAGACAC TGGGACATAAT CT 

ff* GT T TGGCAT CTACCAGAAATAT CCAGT GAAATAT GGAGAAGGAAAGTGT TGGACT GACAACGGCC 

Hi CGGTGATCCCTGTGGTCTATGATTTTGGCGACGCCCAGAAAACAGCATCTTATTACTCACCCTAT 
N 5 GGCCAGCGGGAATTCACTGCGGGATTTGTTCAGTTCAGGGTATTTAATAACGAGAGAGCAGCCAA 
5 CGCCTTGTGTGCTGGAATGAGGGTCACCGGATGTAACACTGAGCATCACTGCATTGGTGGAGGAG 

?Z GATACTTTCCAGAGGCCAGTCCCCAGCAGTGTGGAGATTTTTCTGGTTTTGATTGGAGTGGATAT 

111 

GGAACTCATGTTGGTTACAGCAGCAGCCGTGAGATAACTGAGGCAGCTGTGCTTCTATTCTATCG 
HI T TGAGAG TTTTGTGG GAG G G AAC C CAG AC CTCTCCTCC C AAC CAT GAG AT C C C AAG GAT G G AG AA 

Q CAACT TACCCAGTAGCTAGAATGTTAATGGCAGAAGAGAAAACAATAAATCATAT TGACTCAAGA 

iU AAAAAA 
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FIGURE 88 



MNQLSFLLFLIATTRGWSTDEANTYFKEWTCSSSPSLPRSCKEIKDECPSAFDGLYFLRTENGVI 
YQTFCDMTSGGGGWTLVASVHENDMRGKCTVGDRWSSQQGSKADYPEGDGNWANYNTFGSAEAAT 
SDDYKNPGYYDIQAKDLGIWHVPNKSPMQHWRNSSLLRYRTDTGFLQTLGHNLFGIYQKYPVKYG 
EGKCWTDNGPVI PWYDFGDAQKTAS YYSPYGQRE FTAGFVQFRVFNNERAANALCAGMRVTGCN 
TEHHCIGGGGYFPEASPQQCGDFSGFDWSGYGTHVGYSSSREITEAAVLLFYR 

Important features: 
Signal peptide: 

amino acids 1-16 

N-glycosylation site. 

amino acids 163-167 

Glycosaminoglycan attachment sites. 

amino acids 74-78, 289-293 

N-myristoylation sites. 

amino acids 76-82, . 115-121, 124-130, 253-259, 292-298 
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FIGURE 8Q 



CTAGATTTGTCGGCTTGCGGGGAGACTTCAGGAGTCGCTGTCTCTGAACTTCCAGCCTCAGAGAC 
CGCCGCCCTTGTCCCCGAGGGCCATGGGCCGGGTCTCAGGGCTTGTGCCCTCTCGCTTCCTGACG 
CTCCTGGCGCATCTGGTGGTCGTCATCACCTTATTCTGGTCCCGGGACAGCAACATACAGGCCTG 
CCTGCCTCTCACGTTCACCCCCGAGGAGTATGACAAGCAGGACATTCAGCTGGTGGCCGCGCTCT 
CTGTCACCCTGGGCCTCTTTGCAGTGGAGCTGGCCGGTTTCCTCTCAGGAGTCTCCATGTTCAAC 
AGCACCCAGAGCCTCATCTCCATTGGGGCTCACTGTAGTGCATCCGTGGCCCTGTCCTTCTTCAT 
ATTCGAGCGTTGGGAGTGCACTACGTATTGGTACATTTTTGTCTTCTGCAGTGCCCTTCCAGCTG 
TCACTGAAATGGCTTTATTCGTCACCGTCTTTGGGCTGAAAAAGAAACCCTTCTGATTACCTTCA 
TGACGGGAACCTAAGGACGAAGCCTACAGGGGCAAGGGCCGCTTCGTATTCCTGGAAGAAGGAAG 
GCATAGGCTTCGGTTTTCCCCTCGGAAACTGCTTCTGCTGGAGGATATGTGTTGGAATAATTACG 
TCTTGAGTCTGGGATTATCCGCATTGTATTTAGTGCTTTGTAATAAAATATGTTTTGTAGTAACA 

T TAAGAC T TATATACAGT T T TAGGGGACAAT TAAAAAAAAAAAA 
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FIGURE QO 

MGRVSGLVPSRFLTLIAHLWVITLFWSRDSNIQACLPLTFTPEEYDKQDIQLVAALSVTLGLFA 
VELAGFLSGVSMFNSTQSLISIGAHCSASVALSFFIFERWECTTYWYIFVFCSALPAVTEMALFV 
TVFGLKKKPF 



Transmembrane domain: 

amino acids 12-28 (type II), 51-66, 107-124 

M, 
C3 
Q 
111 
W 

m 

CP 
u 

o 
in 

o 

m 

Q 

ru 
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FIGURE Ql 

CTGGGACCCCGAAAAGAGAAGGGGAGAGCGAGGGGACGAGAGCGGAGGAGGAAGATCCAACTGAC 
TCGCTGCTGCTTCGTGTTCCTGGTGCAGGGTAGCCTCTATCTGGTCATCTGTGGCCAGGATGATG 
GTCCTCCCGGCTCAGAGGACCCTGAGCGTGATGACCACGAGGGCCAGCCCCGGCCCCGGGTGCCT 
CGGAAGCGGGGCCACATCTCACCTAAGTCCCGCCCCATGGCCAATTCCACTCTCCTAGGGCTGCT 
GGCCCCGCCTGGGGAGGCTTGGGGCATTCTTGGGCAGCCCCCCAACCGCCCGAACCACAGCCCCC 
CACCCTCAGCCAAGGT GAAGAAAAT C T TT GGC T GGGGCGAC T TC TAC T CCAACAT CAAGACGG TG 
jU GCCCTGAACCTGCTCGTCACAGGGAAGATTGTGGACCATGGCAATGGGACCTTCAGCGTCCACTT 
Q CCAACACAATGCCACAGGCCAGGGAAACATCTCCATCAGCCTCGTGCCCCCCAGTAAAGCTGTAG 
y AGTTCCACCAGGAACAGCAGATCTTCATCGAAGCCAAGGCCTCCAAAATCTTCAACTGCCGGATG 
*[I GAGTGGGAGAAGGTAGAACGGGGCCGCCGGACCTCGCTTTGCACCCACGACCCAGCCAAGATCTG 
III CTCCCGAGACCACGCTCAGAGCTCAGCCACCTGGAGCTGCTCCCAGCCCTTCAAAGTCGTCTGTG 
CP T C T ACATCGCCT TCTACAGCACGGAC TAT CGGCTGGT CCAGAAGGTGTGCCCAGATT ACAACTAC 

CATAGTGATACCCCCTACTACCCATCTGGGTGACCCGGGGCAGGCCACAGAGGCCAGGCCAGGGC 
jL : TGGAAGGACAGGCCTGCCCATGCAGGAGACCATCTGGACACCGGGCAGGGAAGGGGTTGGGCCTC 
III AGGCAGGGAGGGGGGTGGAGACGAGGAGATGCCAAGTGGGGCCAGGGCCAAGTCTCAAGTGGCAG 
Q AGAAAGGGT CCCAAGT GCT GGTCCCAACCT GAAGCT GT GGAGTGACTAGATCACAGGAGCAC T GG 

III AGGAGGAGTGGGCTCTCTGTGCAGCCTCACAGGGCTTTGCCACGGAGCCACAGAGAGATGCTGGG 
TCCCCGAGGCCTGTGGGCAGGCCGATCAGTGTGGCCCCAGATCAAGTCATGGGAGGAAGCTAAGC 
C C T T GG T T C T T G C CAT CC T GAG GAAAGAT AGCAAC AGG GAG GGGGAGAT T T CAT CAG T G T GGACA 
GCCTGTCAACTTAGGATGGATGGCTGAGAGGGCTTCCTAGGAGCCAGTCAGCAGGGTGGGGTGGG 
GCCAGAGGAGCTCTCCAGCCCTGCCTAGTGGGCGCCCTGAGCCCCTTGTCGTGTGCTGAGCATGG 
CATGAGGCTGAAGTGGCAACCCTGGGGTCTTTGATGTCTTGACAGATTGACCATCTGTCTCCAGC 
CAGGCCACCCCTTTCCAAAATTCCCTCTTCTGCCAGTACTCCCCCTGTACCACCCATTGCTGATG 
GCACACCCATCCT TAAGCTAAGACAGGACGAT T GTGGT CCTCCCACACTAAGGCCACAGCCCATC 
CGCGTGCTGTGTGTCCCTCTTCCACCCCAACCCCTGCTGGCTCCTCTGGGAGCATCCATGTCCCG 
GAGAGGGGTCCCTCAACAGTCAGCCTCACCTGTCAGACCGGGGTTCTCCCGGATCTGGATGGCGC 
CGCCCTCTCAGCAGCGGGCACGGGTGGGGCGGGGCCGGGCCGCAGAGCATGTGCTGGATCTGTTC 
TGTGTGTCTGTCTGTGGGTGGGGGGAGGGGAGGGAAGTCTTGTGAAACCGCTGATTGCTGACTTT 
TGTGTGAAGAATCGTGTTCTTGGAGCAGGAAATAAAGCTTGCCCCGGGGCA 
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FIGURE Q2 

MQLTRCCFVFLVQGSLYLVICGQDDGPPGSEDPERDDHEGQPRPRVPRKRGHISPKSRPMANSTL 
LGLLAPPGEAWGILGQPPNRPNHSPPPSAKVKKIFGWGDFYSNIKTVALNLLVTGKIVDHGNGTF 
SV1IFQHNATGQGNISISLVPPSKAVEFHQEQQIFIEAKASKIFNCRMEWEKVERGRRTSLCTHDP 
AKICSRDHAQSSATWSCSQPFKWCVYIAFYSTDYRLVQKVCPDYNYHSDTPYYPSG 

Important features of the protein: 
Signal peptide: 

amino acids 1-14 

N-glycosylation sites. 

amino acids 62-65, 127-130, 137-140, 143-146 

2-oxo acid dehydrogenases acyl transferase 

amino acids 61-71 
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FIGURE Q3 



CGGTGGCCATGACTGCGGCCGTGTTCTTCGGCTGCGCCTTCATTGCCTTCGGGCCTGCGCTCGCC 
CTTTATGTCTTCACCATCGCCATCGAGCCGTTGCGTATCATCTTCCTCATCGCCGGAGCTTTCTT 
CTGGTTGGTGTCTCTACTGATTTCGTCCCTTGTTTGGTTCATGGCAAGAGTCATTATTGACAACA 
AAGATGGACCAACACAGAAATATCTGCTGATCTTTGGAGCGTTTGTCTCTGTCTATATCCAAGAA 
ATGT T CCGAT T T GCAT ATT AT AAAC TC T TAAAAAAAGCCAGTGAAGGT T TGAAGAGTAT AAACCC 
AGGTGAGACAGCACCCTCTATGCGACTGCTGGCCTATGTTTCTGGCTTGGGCTTTGGAATCATGA 
GTGGAGTATTTTCCTTTGTGAATACCCTATCTGACTCCTTGGGGCCAGGCACAGTGGGCATTCAT 
GGAGATTCTCCTCAATTCTTCCTTTATTCAGCTTTCATGACGCTGGTCATTATCTTGCTGCATGT 
ATTCTGGGGCATTGTATTTTTTGATGGCTGTGAGAAGAAAAAGTGGGGCATCCTCCTTATCGTTC 
TCCTGACCCACCTGCTGGTGTCAGCCCAGACCTTCATAAGTTCTTATTATGGAATAAACCTGGCG 
TCAGCATTTATAATCCTGGTGCTCATGGGCACCTGGGCATTCTTAGCTGCGGGAGGCAGCTGCCG 
AAGCCTGAAACTCTGCCTGCTCTGCCAAGACAAGAACTTTCTTCTTTACAACCAGCGCTCCAGAT 
AACCTCAGGGAACCAGCACT TCCCAAACCGCAGACTACAT CTT TAGAGGAAGCACAACTGTGCCT 
T T T T C T GAAAAT CCCT TT T T C TGGTGGAAT T GAGAAAGAAATAAAACTATGCAGATA 
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FIGURE Q4 

MTAAVFFGCAFI AFGPALALYVFT I AI E PLR 1 1 FL I AGAFFWL VSLL I S SLVWFMARVI I DNKDG 
PTQKYLLI FGAFVSVYI QEMFRFAYYKLLKKASEGLKS I NPGE TAPSMRLLAYVS GLGFG IMS GV 
FS FVNTLS DS LGPGTVG I HGDS PQFFLY S AFMTLVT I LLHVFWG I VFFDGCEKKKWG I LL I VLLT 
HLLVSAQTFI SSYYGINLASAFI ILVLMGTWAFLAAGGSCRSLKLCLLCQDKNFLLYNQRSR 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 

Transmembrane domains: 

amino acids 32-51, 119-138, 152-169, 216-235 

Glycosaminoglycan attachment site. 

amino acids 120-123 ' 

Sodium neurotransmitter symporter family protein 

amino acids 31-65 
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FIGURE 



AAT T T T T CACCAGAGTAAACT TGAGAAACCAACT GGACC T T GAG TAT T GTACAT TTTGCCTCGTG 
GACCCAAAGGT AGCAATCTGAAACATGAGGAGTACGAT TCTAC TGT T TTGTC TTCTAGGATCAAC 
TCGGTCATTACCACAGCTCAAACCTGCTTTGGGACTCCCTCCCACAAAACTGGCTCCGGATCAGG 
GAACAC TACCAAACCAACAGCAGTCAAATCAGG TCTTTCCTTC T TTAAGT C TGATACCATT AACA 
CAGATGCTCACACTGGGGCCAGATCTGCATCTGTTAAATCCTGCTGCAGGAATGACACCTGGTAC 
CCAGACCCACCCATT GACCCTGGGAGGGTTGAATGTACAACAGCAACTGCACCCACATGTGT TAC 

C AAT T T TT G T CACAGAAC T T GGAGCCCAGGGCAC TAT C CT AAGC T CAGAGGAAT TGCC ACAAAT C 
TTCACGAGCCTCATCATCCATTCCTTGTTCCCGGGAGGCATCCTGCCCACCAGTCAGGCAGGGGC 

TAATCCAGATGT CCAGGAT GGAAGCC T TCCAGCAGGAGGAGCAGGTGTAAAT CCT GCCACCCAGG 
GAACCCCAGCAGGCCGCCTCCCAACTCCCAGTGGCACAGATGACGACTTTGCAGTGACCACCCCT 
GCAGGCATCCAAAGGAGCACACATGCCATCGAGGAAGCCACCACAGAATCAGCAAATGGAATTCA 
G TAAGC T G T T T C AAAT T T T TT CAAC T AAGC T G C C T C G AAT T T GG T GAT ACAT GT G AAT C T T TAT C 
AT TGATTATAT TATGGAATAGATT GAGACACAT TGGATAGT CT TAGAAGAAAT TAAT T C T T AAT T 
TACCT GAAAAT AT TCT T GAAAT T T CAGAAAAT AT GTT CT ATGTAGAGAATCCCAAC TT T T AAAAA 
CAATAATTCAATGGATAAATCTGTCTTTGAAATATAACATTATGCTGCCTGGATGATATGCATAT 
TAAAACATATTTGGAAAACTGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE Q6 

MRSTILLFCLLGSTRSLPQLKPALGLPPTKLAPDQGTLPNQQQSNQVFPSLSLIPLTQM 
LTLGPDLHLLNPAA.GMTPGTQTHPLTLGGLNVQQQLHPHVLPIFVTQLGAQGTILSSEE 
LPQIFTSLIIHSLFPGGILPTSQAGANPDVQDGSLPAGGAGWPATQGTPAGRLPTPSG 
TDDDFAVT TPAG I QRS THAIEEATTES ANG IQ 



Signal peptide: 

M: amino acids 1-16 

o 

€3 

m 
w 
m 
m 

H 

o 

II! 

£3 
III 

c:s 
rii 
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FIGURE Q7 

GCTCAAGTGCCCTGCCTTGCCCCACCCAGCCCAGCCTGGCCAGAGCCCCCTGGAGAAGGAGCTCT 
CTTCTTGCTTGGCAGCTGGACCAAGGGAGCCAGTCTTGGGCGCTGGAGGGCCTGTCCTGACCATG 
GTCCCTGCCTGGCTGTGGCTGCTTTGTGTCTCCGTCCCCCAGGCTCTCCCCAAGGCCCAGCCTGC 
AGAGCTGTCTGTGGAAGTTCCAGAAAACTATGGTGGAAATTTCCCTTTATACCTGACCAAGTTGC 
CGCTGCCCCGTGAGGGGGCTGAAGGCCAGATCGTGCTGTCAGGGGAGTCAGGCAAGGCAACTGAG 
GGCCCATTTGCTATGGATCCAGATTCTGGCTTCCTGCTGGTGACCAGGGCCCTGGACCGAGAGGA 
GCAGGCAGAGTACCAGCT ACAGGTCACCCT GGAGAT GCAGGAT GGACATGTC TT G T GGGGT CCAC 
AGCCTGTGCTTGTGCACGTGAAGGATGAGAATGACCAGGTGCCCCATTTCTCTCAAGCCATCTAC 
AGAGCTCGGCTGAGCCGGGGTACCAGGCCTGGCATCCCCTTCCTCTTCCTTGAGGCTTCAGACCG 
J.J, GGATGAGCCAGGCACAGCCAACTCGGATCTTCGATTCCACATCCTGAGCCAGGCTCCAGCCCAGC 
**% CTTCCCCAGACATGTTCCAGCTGGAGCCTCGGCTGGGGGCTCTGGCCCTCAGCCCCAAGGGGAGC 
I;! ACCAGCCTTGACCACGCCCTGGAGAGGACCTACCAGCTGTTGGTACAGGTCAAGGACATGGGTGA 
\zf CCAGGCCTCAGGCCACCAGGCCACTGCCACCGTGGAAGTCTCCATCATAGAGAGCACCTGGGTGT 
IP CCCTAGAGCCTATCCACCTGGCAGAGAATCTCAAAGTCCTATACCCGCACCACATGGCCCAGGTA 
III CACTGGAGTGGGGGTGATGTGCACTATCACCTGGAGAGCCATCCCCCGGGACCCTTTGAAGTGAA 
Iff T GCAGAGGGAAAC C T C T AC GT GAC C AGAGAG C T GG ACAGAGAAGC C CAGGC T GAG T AC C T G C T CC 

Sf AGGTGCGGGCTCAGAATTCCCATGGCGAGGACTATGCGGCCCCTCTGGAGCTGCACGTGCTGGTG 
f 8 ATGGATGAGAATGACAACGTGCCTATCTGCCCTCCCCGTGACCCCACAGTCAGCATCCCTGAGCT 
H CAGTCCACCAGGTACTGAAGTGACTAGACTGTCAGCAGAGGATGCAGATGCCCCCGGCTCCCCCA 
5 ATTCCCACGTTGTGTATCAGCTCCTGAGCCCTGAGCCTGAGGATGGGGTAGAGGGGAGAGCCTTC 
Q CAGGTGGACCCCACTTCAGGCAGTGTGACGCTGGGGGTGCTCCCACTCCGAGCAGGCCAGAACAT 
|JJ CCTGCTTCTGGTGCTGGCCATGGACCTGGCAGGCGCAGAGGGTGGCTTCAGCAGCACGTGTGAAG 
TCGAAGTCGCAGTCACAGATATCAATGATCACGCCCCTGAGTTCATCACTTCCCAGATTGGGCCT 
CI ATAAGCCTCCCTGAGGATGTGGAGCCCGGGACTCTGGTGGCCATGCTAACAGCCATTGATGCTGA 
f|| CC TCGAGC CCGCCT T CCGCC TCAT GGATT T T GC CAT T GAGAGGGGAGACACAGAAGGGAC T T T TG 

H GCCTGGATTGGGAGCCAGACTCTGGGCATGTTAGACTCAGACTCTGCAAGAACCTCAGTTATGAG 
p| GCAGCTCCAAGTCATGAGGTGGTGGTGGTGGTGCAGAGTGTGGCGAAGCTGGTGGGGCCAGGCCC 
lU AGGCCCTGGAGCCACCGCCACGGTGACTGTGCTAGTGGAGAGAGTGATGCCACCCCCCAAGTTGG 
ACCAGGAGAGCTACGAGGCCAGTGTCCCCATCAGTGCCCCAGCCGGCTCTTTCCTGCTGACCATC 
CAGCCCTCCGACCCCATCAGCCGAACCCTCAGGTTCTCCCTAGTCAATGACTCAGAGGGCTGGCT 
CTGCATTGAGAT^ATTCTCCGGGGAGGTGCACACCGCCCAGTCCCTGCAGGGCGCCCAGCCTGGGG 
ACACCTACACGGTGCTTGTGGAGGCCCAGGATACAGCCCTGACTCTTGCCCCTGTGCCCTCCCAA 
TACC T C TGCACAC CCCGCCAAGACCATGGC T TGAT CG T GAGT GGACCCAGCAAGGACCCCGAT CT 
GGCCAGTGGGCACGGTCCCTACAGCTTCACCCTTGGTCCCAACCCCACGGTGCAACGGGATTGGC 
GCCTCCAGACTCTCAATGGTTCCCATGCCTACCTCACCTTGGCCCTGCATTGGGTGGAGCCACGT 
GAACACATAATCCCCGTGGTGGTCAGCCACAATGCCCAGATGTGGCAGCTCCTGGTTCGAGTGAT 
CGTGTGTCGCTGCAACGTGGAGGGGCAGTGCATGCGCAAGGTGGGCCGCATGAAGGGCATGCCCA 
CGAAGCTGTCGGCAGTGGGCATCCTTGTAGGCACCCTGGTAGCAATAGGAATCTTCCTCATCCTC 
AT T T T C AC C C AC T GG AC CAT G T C AAG GAAGAAG GAC C C G GAT CAAC CAG C AG AC AG CGTGCCCCT 
GAAGGCGACTGTCTGAATGGCCCAGGCAGCTCTAGCTGGGAGCTTGGCCTCTGGCTCCATCTGAG 
TCCCC TGGGAGAGAGCCCAGCACCCAAGATCCAGCAGGGGACAGGACAGAGTAGAAGCCCCT CCA 
TCTGCCCTGGGGTGGAGGCACCATCACCATCACCAGGCATGTCTGCAGAGCCTGGACACCAACTT 
TATGGACTGCCCATGGGAGTGCTCCAAATGTCAGGGTGTTTGCCCAATAATAAAGCCCCAGAGAA 
CTGGGCTGGGCCCTATGGGAAAAAAAAAAAAAAA?WiAAAAAA?\AAAAAAAAG 
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FIGURE Q8 



MVPAWLWLLCVSVPQALPKAQPAELSVEVPENYGGNFPLYLTKLPLPREGAEGQIVLSGDSGKAT 

EGPFi^DPDSGFLLVTRJ^DREEQAEYQLQVTLEMQDGHVLWGPQPVLVHVKDENDQVPHFSQAI 

YRARLSRGTRPGIPFLFLEASDRDEPGTANSDLRFHILSQAPAQPSPDMFQLEPRLGftLALSPKG 

S T S LDHALERT YQLL VQVKDMGDQAS GHQAT AT VE VS 1 1 E S TWVS LE P I HLAENLKVLY PHHMAQ 

VHWSGGDVHYHLESHPPGPFEVNAEGNLYVTRELDREAQAEYLLQVRAQNSHGEDYAAPLELHVL 

VMDENDNVPICPPRDPTVSIPELSPPGTEVTRLSAEDADAPGSPNSHWYQLLSPEPEDGVEGRA 

FQVDPTSGSVTLGVLPLRAGQNILLLVLAMDLAGAEGGFSSTCEVEVAVTDINDHAPEFITSQIG 

PISLPEDVEPGTLVAMLTAIDADLEPAFRLMDFAIERGDTEGTFGLDWEPDSGHVRLRLCKNLSY 

EAAPSHEVVVWQSVAKLVGPGPGPGATATVTVLVERVMPPPKLDQES YEASVPI SAPAGS FLLT 

IQPSDPISRTLRFSLVNDSEGWLCIEKFSGEVHTAQSLQGAQPGDTYTVLVEAQDTALTLAPVPS 

QYLCTPRQDHGLIVSGPSKDPDLASGHGPYSFTLGPNPTVQRDWRLQTLNGSHAYLTLALHWVEP 

REHI IPVWSHNAQMWQLLVRVIVCRCNVEGQ 

L I FTHW TMS RKKDPDQPADS VPLKATV 

Signal peptide: 

amino acids 1-18 

Transmembrane domain: 

amino acids 762-784 
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FIGURE QQ 

GGCTGACCGTGCTACATTGCCTGGAGGAAGCCTAAGGAACCCAGGCATCCAGCTGCCCACGCCTG 
AGTCCAAGATTCTTCCCAGGAACACAAACGTAGGAGACCCACGCTCCTGGAAGGACCAGCCTTTA 
TCTCTTCACCTTCAAGTCCCCTTTCTCAAGAATCCTCTGTTCTTTGCCCTCTAAAGTCTTGGTAC 
AT CTAGGACCCAGGCATC T TGC T T T CCAGCCACAAAGAGACAGATGAAGATGCAGAAAGGAAAT G 
T T C TCCT TATGT T TGGT C T ACTAT T GCAT T TAGAAGCT GCAACAAAT T CCAATGAGAC TAGCACC 
T C TGC CAACACT GGAT CCAGTGTGATC TCCAGT GGAGCCAGCACAGCCACCAAC T CTGGGTQCAG 
TGTGACCTCCAGTGGGGTCAGCACAGCCACCATCTCAGGGTCCAGCGTGACCTCCAATGGGGTCA 
GCATAGTCACCAACTCTGAGTTCCATACAACCTCCAGTGGGATCAGCACAGCCACCAACTCTGAG 
TTCAGCACAGCGTCCAGTGGGATCAGCATAGCCACCAACTCTGAGTCCAGCACAACCTCCAGTGG 
GGCCAGCACAGCCACCAAC T CT GAGTCCAGCACACCC T CCAGTGGGGCCAGCACAG TCACCAAC T 
CTGGGTCCAGTGTGACCTCCAGTGGAGCCAGCACTGCCACCAACTCTGAGTCCAGCACAGTGTCC 
AGTAGGGCCAGCACTGCCACCAACTCTGAGTCTAGCACACTCTCCAGTGGGGCCAGCACAGCCAC 
CAAC T C TGAC T CCAGCACAACC TC CAGTGGGGC TAGCACAGCCACCAAC TC TGAG T CCAGCACAA 
CCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACAGTGTCCAGTAGGGCCAGCACT 
GCCACCAACTCT GAGT CCAGCACAACC TCCAGT GGGGCCAGCACAGCCACCAACT CT GAGT CCAG 
AACGACCTCCAATGGGGCTGGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGGGGCCA 
GCACAGCCACCAACTCTGACTCCAGCACAGTGTCCAGTGGGGCCAGCACTGCCACCAACTCTGAG 
TCCAGCACGACCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGG 
GGCTAGCACAGCCACCAACTCTGACTCCAGCACAACCTCCAGTGGGGCCGGCACAGCCACCAACT 
C T GAGTCCAGCACAGTGTCCAGT GGGAT CAGCACAGT CACCAAT TC TGAGTCCAGCACACCC T CC 
AGT GGGGCCAACACAGCCACCAAC TC TGAGT CCAGTACGACC TCCAGTGGGGCCAACACAGC CAC 
CAAC T C TGAG T CCAGCACAGT GT CCAGT GGGGCCAGCACTGC CAC CAAC TCT GAGT CCAGCACAA 
CCTCCAGTGGGGTCAGCACAGCCACCAACTCTGAGTCCAGCACAACCTCCAGTGGGGCTAGCACA 
GCCACCAACTCT GACT CCAGCACAACC T CCAGTGAGGCCAGC ACAGCC AC CAAC T C T GAGT CTAG 
CACAG TGT CCAGTGGGAT CAGCACAGT CACCAAT TC TGAG TCCAGCACAACC TCCAGT GGGGCCA 
ACACAGCCACCAACTCTGGGTCCAGTGTGACCTCTGCAGGCTCTGGAACAGCAGCTCTGACTGGA 
ATGCACACAACTTCCCATAGTGCATCTACTGCAGTGAGTGAGGCAAAGCCTGGTGGGTCCCTGGT 
GCCGTGGGAAATCTTCCTCATCACCCTGGTCTCGGTTGTGGCGGCCGTGGGGCTCTTTGCTGGGC 
TC T TC T TCT GT GT GAGAAACAGCC TGT CCC TGAGAAACACCT T TAACACAGC T GT C TACCACCCT 
CATGGCCTCAACCATGGCCTTGGTCCAGGCCCTGGAGGGAATCATGGAGCCCCCCACAGGCCCAG 
GT GGAGT CC TAACT GGT TCT GGAGGAGACCAGT AT CAT CGATAGCCATGGAGATGAGCGGGAGGA 
ACAGCGGGCCCTGAGCAGCCCCGGAAGCAAG T GCC GCAT TC T TCAGGAAGGAAGAGACC T GGGCA 
CCCAAGACCTGGTTTCCTTTCATTCATCCCAGGAGACCCCTCCCAGCTTTGTTTGAGATCCTGAA 
AAT C T T GAAGAAG G TAT TCCT CAC CTTTCTTGCCTT T AC C AGACAC T G GAAAGAGAAT AC TAT AT 
TGCTCAT T TAGCTAAGAAAT7VAATACATC TCATC TAACACACACGACAAAGAGAAGC TGT GCT TG 
CCCCGGGGTGGGTATCTAGCTCTGAGATGAACTCAGTTATAGGAGAAAACCTCCATGCTGGACTC 
CATCTGGCATTCAAAATCTCCACAGTAAAATCCAAAGACCTCAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



FIGURE loo 



MKMQKGNVLLMFGLLLHLEAATNSNETSTSANTGSSVISSGASTATNSGSSVTSSGVSTATISGS 
SVTSNGVSIVTNSEFHTTSSGISTATNSEFSTASSGISIATNSESSTTSSGASTATNSESSTPSS 
GAS T VTNS G S SVT S SGAS TATNS ESS TVS S RAS TATNSES S T L S S GAS TATNSDS S TT S S GAS TA 
TNSESSTTSSGASTATNSESSTVSSRASTATNSESSTTSSGASTATNSESRTTSNGAGTATNSES 
STTSSGAS TATNSDSSTVSSGASTATNSESSTTSSGASTATNSESSTTSSGASTATNSDSSTTSS 
GAGTATNSESSTVSSGISTVTNSESSTPSSGANTATNSESSTTSSGANTATNSESSTVSSGASTA 
TNSESSTTSSGVSTATNSESSTTSSGASTATNSDSSTTSSEASTATNSESSTVSSGISTVTNSES 
S TTS S GANTATNSGS SVT S AGSGTAALTGMHT TSHSAS TAVSEAKPGGS LVPWE I FL I TLVS WA 
AVGLFAGLFFCVRNSLSLRNTFNTAVYHPHGLNHGLGPGPGGNHGAPHRPRWSPNWFWRRPVSSI 

AMEMSGRNFSGP 

Signal peptide: 

amino acids 1-20 

Transmembrane domain: 

amino acids 510-532 
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FIGURE 1Q1 



GGCCGGACGCCTCCGCGTTACGGGATGAATTAACGGCGGGTTCCGCACGGAGGTTGTGACCCCTA 
CGGAGCCCCAGCTTGCCCACGCACCCCACTCGGCGTCGCGCGGCGTGCCCTGCTTGTCACAGGTG 
GGAGGC T GGAAC TAT C AGGC T GAAAAACAGAGT GGGT ACT C T C T TC TGGGAAGCT GGCAACAAAT 
GGATGATGTGATATATGCATTCCAGGGGAAGGGAAATTGTGGTGCTTCTGAACCCATGGTCAATT 
AACGAGGCAGTTTCTAGCTACTGCACGTACTTCATAAAGCAGGACTCTAAAAGCTTTGGAATCAT 
GGTGTCATGGAAAGGGATTTACTTTATACTGACTCTGTTTTGGGGAAGCTTTTTTGGAAGCATTT 
TCATGCTGAGTCCCTTTTTACCTTTGATGTTTGTAAACCCATCTTGGTATCGCTGGATCAAGAAC 
f*i CGCCTTGTGGCAACATGGCTCACCCTACCTGTGGCATTATTGGAGACCATGTTTGGTGTAAAAGT 
□ GATTATAACTGGGGATGCAT TTGTTCCTGGAGAAAGAAGTGTCAT TATCATGAACCATCGGACAA 

CO GAAT GGAC TGGATGT T CC TG TGGAAT T GCCTGAT GCGAT AT AGC T ACC T CAGAT T GGAGAAAAT T 

Jjj TGCCTCAAAGCGAGTCTCAAAGGTGTTCCTGGATTTGGTTGGGCCATGCAGGCTGCTGCCTATAT 
C T TCAT T CATAGGAAAT GGAAGGAT GACAAGAGCCAT T T CGAAGACAT GAT T GAT TACT T T T G TG 
\& ATAT TCACGAACCACT T CAACTCC TCATAT T CCCAGAAGGGAC TGATCT CACAGAAAACAGCAAG 

a T C T C G AAG T AAT GCATTTGCT GAAAAAAAT G G AC T T CAG AAAT AT GAAT AT G T T T T AC AT C C AAG 

O AACTACAGGCTT TACT TT TGTGGTAGACCGTCTAAGAGAAGGTAAGAACCTTGAT GCTGT CCATG 

J| ATATCACTGTGGCGTATCCTCACAACATTCCTCAATCAGAGAAGCACCTCCTCCAAGGAGACTTT 
||| CCCAGGGAAAT CCACTT TCACGTCCACCGGTATCCAATAGACACCC TCCCCACATCCAAGGAGGA 

Q CCTTCAACTCTGGTGCCACAAACGGTGGGAAGAGAAAGAAGAGAGGCTGCGTTCCTTCTATCAAG 
fU GGGAGAAGAATTTTTATTTTACCGGACAGAGTGTCATTCCACCTTGCAAGTCTGAACTCAGGGTC 
CTTGTGGTCAAATTGCTCTCTATACTGTATTGGACCCTGTTCAGCCCTGCAATGTGCCTACTCAT 
ATATT T GTACAGT CTTGT TAAGTGGTAT T T TATAAT CAC CAT T GTAATCT T TGT GCT GCAAGAGA 
GAATAT T TGGT GGAC T GGAGAT CAT AGAACT T GCAT GT T ACCGAC T T T TACACAAAC AGCC AC AT 
T TAAAT TCAAAGAAAAATGAGTAAGAT TATAAGGT T T GCCAT GTGAAAACCTAGAGCATAT TT TG 
GAAATGTTCTAAACCTTTCTAAGCTCAGATGCATTTTTGCATGACTATGTCGAATATTTCTTACT 
GCCATCAT TATT TGTTAAAGATAT T T TGCACT T AAT TT TGTGGGAAAAATATT GCTACAAT T T T T 
T T TAATC T C T GAAT GT AATT TCGATACT GTG T ACAT AGCAGGGAGTGAT CGGGGTGAAAT AACT T 
GGGCCAGAATAT TAT TAAACAATCAT CAGGC T T T T AAA 
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FIGURE 1Q2 



MHSRGREIVVLLNPWSINEAVSSYCTYFIKQDSKSFGIMVSWKGIYFILTLFWGSFFGSIFMLSP 
FLPLMFTOPSWYRWINNRLVATWLTLPVALLETMFGVKVI ITGDAFVPGERSVI IMNHRTRMDWM 
FL WNC LMRYS YLRLEK I CLKAS LKGVPGFGWAMQAAAY I F I HRKWKDDKS H FEDM I D Y FC D I HE P 
LQLLIFPEGTDLTENSKSRSNAFAEKNGLQKYEYVLHPRTTGFTFWDRLREGKNLDAVHDITVA 
YPHNIPQSEKHLLQGDFPREIHFHVHRYPIDTLPTSKEDLQLWCHKRWEEKEERLRSFYQGEKNF 
YFTGQSVIPPCKSELRVLVVKLLSILYWTLFSPAMCLLIYLYSLVKWYFIITIVIFVLQERIFGG 
LE I I ELACYRLLHKQPHLNSKKNE 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 

Transmembrane domains: 

amino acids 44-63, 90-108, 354-377 
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FIGURE 10ft 



CGGCTCGAGCGGCTCGAGTGAAGAGCCTCTCCACGGCTCCTGCGCCTGAGACAGCTGGCCTGACC 
TCCAAATCATCCATCCACCCCTGCTGTCATCTGTTTTCATAGTGTGAGATCAACCCACAGGAATA 
TCC ATG GCTTTTGTGCTCATTTTGGTTCTCAGTTTCTACGAGCTGGTGTCAGGACAGTGGCAAGT 
CACTGGACCGGGCAAGTTTGTCCAGGCCTTGGTGGGGGAGGACGCCGTGTTCTCCTGCTCCCTCT 
TTCCTGAGACCAGTGCAGAGGCTATGGAAGTGCGGTTCTTCAGGAATCAGTTCCATGCTGTGGTC 
CACC TC TACAGAGATGGGGAAGAC TGGGAAT CTAAGCAGAT GCCACAGTATCGAGGGAGAAC T GA 
GTTTGTGAAGGACTCCATTGCAGGGGGGCGTGTCTCTCTAAGGCTAAAAAACATCACTCCCTCGG 
ACATCGGCCTGTATGGGTGCTGGTTCAGTTCCCAGATTTACGATGAGGAGGCCACCTGGGAGCTG 
CGGGTGGCAGCACTGGGCTCACTTCCTCTCATTTCCATCGTGGGATATGTTGACGGAGGTATCCA 
u ,. GTTACTCTGCCTGTCCTCAGGCTGGTTCCCCCAGCCCACAGCCAAGTGGAAAGGTCCACAAGGAC 
AGGATTTGTCTTCAGACTCCAGAGCAAATGCAGATGGGTACAGCCTGTATGATGTGGAGATCTCC 
K AT TATAGTC CAGGAAAAT GCTGGGAGCATAT TGT GTT CCAT CCACCT T GCTGAGCAGAGT CAT GA 

Q GGTGGAATCCAAGGTATTGATAGGAGAGACGTTTTTCCAGCCCTCACCTTGGCGCCTGGCTTCTA 
ff! TTTTACTCGGGTTACTCTGTGGTGCCCTGTGTGGTGTTGTCATGGGGATGATAATTGTTTTCTTC 
| \ AAATCCAAAGGGAAAAT CCAGGCGGAAC TGGACTGGAGAAGAAAGCACGGACAGGCAGAATT GAG 

™ AGACGCCCGGAAACACGCAGTGGAGGTGACTCTGGATCCAGAGACGGCTCACCCGAAGCTCTGCG 
Z! T T TC TGAT C TGAAAAC T GTAACCCATAGAAAAGC T CC CCAGGAGGT GCCTCAC TC T GAGAAGAGA 

p TT TACAAGGAAGAGTGTGGT GGC T TC TCAGGGT T T CCAAGCAGGGAGACAT TACT GGGAGGT GGA 

M CGTGGGACAAAATGTAGGGTGGTATGTGGGAGTGTGTCGGGATGACGTAGACAGGGGGAAGAACA 
ATGTGACTTTGTCTCCCAACAATGGGTATTGGGTCCTCAGACTGACAACAGAACATTTGTATTTC 
ACATTCAATCCCCATTTTATCAGCCTCCCCCCCAGCACCCCTCCTACACGAGTAGGGGTCTTCCT 

ggactatgagggtgggaccatctccttcttcaatacaaatgaccagtcccttatttataccctgc 
tgacatgtcagtttgaaggcttgttgagaccctatatccagcatgcgatgtatgacgaggaaaag 
C3 gggactcccatattcatatgtccagtgtcctggggatgagacagagaagaccctgcttaaagggc 
fij cccacaccacagacccagactvcagccaagggagagtgctcccgacaggtggccccagcttcctct 
ccggagcctgcgcacagagagtcacgccccccactctcctttagggagctgaggttcttctgccc 
5! tgagccctgcagcagcggcagtcacagcttccagatgaggggggattggcctgaccctgtgggag 
fM tcagaagccatggctgccctgaagtggggacggaatagactcacattaggtttagtttgtgaaaa 
ctccatccagctaagcgatcttgaacaagtcacaacctcccaggctcctcatttgctagtcacgg 
acagtgattcctgcctcacaggtgaagattaaagagacaacgaatgtgaatcatgcttgcaggtt 
tgagggcacagtgtttgctaatgatgtgtttttatattatacattttcccaccataaactctgtt 
tgcttattccacattaatttacttttctctataccaaatcacccatggaatagttattgaacacc 
tgctttgtgaggctcaaagaataaagaggaggtaggatttttcactgattctataagcccagcat 
tacctgataccaaaaccaggcaaagaaaacagaagaagaggaaggaaaactacaggtccatatcc 
ctcattaacacagacacaaaaattctaaataaaattttaacaaattaaactaaacaatatattta 
aagatgatatataactactcagtgtggtttgtcccacaaatgcagagttggtttaatatttaaat 
atcaaccagtgtaattcagcacattaataaagtaaaaaagaaaaccataaaaaaaaaaaaaaa 
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^FVLILVLSFYELVSGQWQVTGPGKFVQALVGEDAVFSCSLFPETSAEAMEVRFFRNQFHAVVH 
LYRDGEDWESKQMPQYRGRTEFVKDSIAGGRVSLRLKNITPSDIGLYGCWFSSQIYDEEATWELR 
VAALGSLPL I S I VGYVDGGI QLLCLS SGWFPQPTAKWKGPQGQDLS SDSRANADGYSLYDVE I S I 
IVQENAGSILCSIHLAEQSHEVESKVLIGETFFQPSPWRLASILLGLLCGALCGVVMGMIIVFFK 
SKGKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEECRF 
TRKSWASQGFQAGRHYWEVDVGQNVGWYVGVCRDDVDRGKNNVTLSPNNGYWVLRLTTEHLYFT 
FNPHFISLPPSTPPTRVGVFLDYEGGTISFFNTNDQSLIYTLLTCQFEGLLRPYIQHAMYDEEKG 

P| TPIFICPVSWG 

Signal peptide: 

Y;i amino acids 1-17 

Transmembrane domains : 

amino acids 131-150, 235-259 



ill 

Ill 
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CCTT CACAGG ACT C TT CAT T GCTGGT TG GC A ATGA TGT AT CGGCC AGATGT GGTGAGGGCT AGG AAAAGAG 
TTTGTTGGGAACCCTGGGTTATCGGCCTCGTCATCTTCATATCCCTGATTGTCCTGGCAGTGTGCATTGGA 
CT C ACT GTTCAT T ATG T GAG AT AT AAT C AAAAGAAGACCT ACAAT T AC T ATAGC AC AT TGTC AT TT AC AAC 
TG AC AAAC TAT ATGCT GAGT T T GGC AGAGAGGCT TC T AACAATTT T AC AG AAAT GAGCCAGAGACTTG AAT 
CAAT GGT GAAAAAT GCATTT TAT AAATCTC CATTAAGGGAAGAATT TGTC AAGT CT C AGGT T AT CAAGTT C 
AGTCAACAGAAGCATGGAGTGTTGGCTCATATGCTGTTGATTTGTAGATTTCACTCTACTGAGGATCCTGA 
AACTGT AGAT AAAATT GT TCAACT TGTT TT AC AT GAAAAGCT GC AAGATGCT GT AGGACCCCCT AAAGT AG 
AT CC TC AC TC AGT T AAAATT AAAAAAAT CAAC AAGACAGAAACAG ACAGCT ATCT AAACCATTGCT GCGGA 
ACACGAAGAAGT AAAACTCT AGGT CAGAGT CT C AGGAT CGTT GGT GGG AC AGAAGT AG AAGAGGGT GAAT G 
GCCCTGGCAGGCTAGCCTGCAGTGGGATGGGAGTCATCGCTGTGGAGCAACCTTAATTAATGCCACATGGC 
TTGT GAGT GCT GCT CACT GT TTT AC AAC AT AT AAGAAC CCTGCC AGAT GG AC TGCT TCCT TTGGAGT AAC A 
n AT AAAACC TTCGAAAATGAAAC GGGGTC TCCGGAGAAT AAT T GT CC AT GAAAAAT ACAAACACCCATC AC A 

f| TG AC TAT GAT ATTT CT CTTGC AGAGCT T TCT AGCCCT GTT CC CT AC AC AAAT GC AGTAC ATAGAGT TTGT C 

TCCC T GAT GC ATCCTATG AGTT TC AACC AG GT GATGT G AT GT TT GT GACAGGAT TT GGAGCACT GAAAAAT 
GATGGTT AC AGTCAAAATCATCT TCGAC AAGCAC AGGT GACT CT CAT AGACGCT AC AACT T GCAAT GAACC 
TC AAGCT T AC AATGACGCCAT AACTCCT AGAATGTT AT GT GC T GGC TC CT T AGAAGGAAAAACAGATGCAT 
GCCAGGGTGACTCTGGAGGACCACTGGTTAGTTCAGATGCTAGAGATATCTGGTACCTTGCTGGAATAGTG 
AGCTGGGGAGATGAATGTGCGAAACCCAACAAGCCTGGTGTTTATACTAGAGTTACGGCCTTGCGGGACTG 
GATTACTTCAAAAACTGGTATCTAAGAGACAAAAGCCTCATGGAACAGATAACATTTTTTTTTGTTTTTTG 

GGTGTGGAGGCC AT TT TT AGAGAT AC AGAAT T GGAGAAGACTTGC AAAAC AGCT AGAT TT GACT G ATCT CA 

if 

ATAAACTGTTTGCTTGATGCATGTATTTTCTTCCCAGCTCTGTTCCGCACGTAAGCATCCTGCTTCTGCCA 
GAT C AACT CT GTC ATCTGTG AG CAAT AG TT GAAACT TT AT GT AC AT AGAG AAAT AGAT AAT ACAAT ATT AC 
AT T ACAGC CT GT AT TC ATTT GT TCTCT AGAAGTT T TGT CAGAAT TT TGACTT GT T GAC AT AAAT T T GT AAT 
GC AT AT AT ACAATT TG AAGC AC TCCTTT TCTT CAGTT C CT CAGCTC CT CT CAT TT CAGC AAAT ATC CATTT 
TC AAGGT GC AGAAC AAGGAGTG AAAG AAAAT ATAAG AAGAAAAAAAT C CCCT ACAT TT T ATT GGC ACAGAA 
AAGT AT T AGGTGT TTT TCTT AGTGGAAT AT T AGAAAT GAT C ATATT CATT AT GAAAGGT C AAGC AAAG AC A 
GC AG AAT AC C AATC ACT T CATC AT TTAGGAAGTATGGGAACT AAGT T AAGGAAGTCC AGAAAGAAGCC AAG 
AT AT AT CC T T AT TT TC ATTT CC AAACAACT ACTATG AT AAATGT GAAGAAG AT TCT GT TT T T TT GT GACCT 
AT AAT AAT T AT ACAAACT TCAT GCAATGTACT TGTTCT AAGC AAATTAAAGC AAAT ATTT ATTTAACATTG 
TT AC TGAGGAT G T C AAC ATAT AAC AAT AAAAT AT AAAT CACCCA 
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mYRPDVVRARKRVCWEPWVIGLVIFISLIVLAVCIGLTVHYVRYNQKKTYNYYSTLSFTTDKLY 
AEFGREASNNFTEMSQRLESMVKNAFYKSPLREEFVKSQVIKFSQQKHGVLAHMLLICRFHSTED 
PETVDKIVQLVLHEKLQDAVGPPKVDPHSVKIKKINKTETDSYLNHCCGTRRSKTLGQSLRIVGG 
TEVEEGEWPWQASLQWDGSHRCGATLINATWLVSAAHC FTTYKNPARWTAS FGVT IKPSKMKRGL 
RRIIVHEKYKHPSHDYDISIiAELSSPVPYTNAVHRVCLPDASYEFQPGDVMFVTGFGALKNDGYS 
QNHLRQAQVT L I DATT CNE PQAYNDAI T PPMLCAGSLE GKT DACQGDS GGPLVS S DARD I WYLAG 
IVSWGDECAKPNKPGVYTRVTALRDWITSKTGI 

Transmembrane domain: 

amino acids 21-40 (type II) 
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AGAGAAAGAAGCGTCTCCAGCTGAAGCCAATGCAGCCCTCCGGCTCTCCGCGAAGAAGTTCCCTG 
CCCCGATGAGCCCCCGCCGTGCGTCCCCGACTATCCCCAGGCGGGCGTGGGGCACCGGGCCCAGC 
GCCGACGATCGCTGCCGTTTTGCCCTTGGGAGTAGGATGTGGTGAAAGGATGGGGCTTCTCCCTT 
ACGGGGCTCACAATGGCCAGAGAAGATTCCGTGAAGTGTCTGCGCTGCCTGCTCTACGCCCTCAA 
TCTGCTCTTTTGGTTAATGTCCATCAGTGTGTTGGCAGTTTCTGCTTGGATGAGGGACTACCTAA 
ATAAT GTTCT CACT T TAAC T GCAGAAACGAGGGT AGAGGAAGCAGTCAT T T T GAC T TAC T T TC CT 
GTGGTTCATCCGGTCATGATTGCTGTTTGCTGTTTCCTTATCATTGTGGGGATGTTAGGATATTG 
TGGAACGGTGAAAAGAAATCTGTTGCTTCTTGCATGGTACTTTGGAAGTTTGCTTGTCATTTTCT 
GT GTAGAAC T GGC T TGTGGCGTT TGGACATAT GAACAGGAAC T TAT GGT TCCAGT ACAAT GGT CA 
GATATGGTCACT T TGAAAGCCAGGATGACAAAT TATGGAT TACC TAGATATCGGT GGC T TAC T CA 
H' T GC T TGGAAT TT T T TT CAGAGAGAGT T TAAGTGC T GT GGAGTAGTATAT T T CACT GACTGGT T GG 

Q AAATGACAGAGATGGACTGGCCCCCAGATTCCTGCTGTGTTAGAGAATTCCCAGGATGTTCCAAA 
Q CAGGCCCACCAGGAAGATCTCAGTGACCTT TAT CAAGAGGGTTGTGGGAAGAAAATG.TAT TCCTT 

fjjft TTTGAGAGGAACCAAACAACTGCAGGTGCTGAGGTTTCTGGGAATCTCCATTGGGGTGACACAAA 
"fl TCCTGGCCATGATTCTCACCATTACTCTGCTCTGGGCTCTGTATTATGATAGAAGGGAGCCTGGG 
*f± ACAGACCAAATGATGTCCT T GAAGAATGACAACT CTCAGCACC TGT CATGTCCCT CAGTAGAACT 

111 GT T GAAACCAAGCC T GT CAAGAAT CT T TGAACACACAT CCAT GGCAAACAGCT TT AATACACACT 

fp T TGAGAT GGAGGAGT TATAAAAAGAAAT GT CACAGAAGAAAACCACAAAC T TGTT T TAT T GGACT 

TGTGAATTTTTGAGTACATACTATGTGTTTCAGAAATATGTAGAAATAAAAATGTTGCCATAAAA 
TAACACCTAAGCAT AT AC TAT TC TATGC T T TAAAAT GAGGATGGAAAAGT T TCAT GT CATAAGT C 
JL ACCACC TGGACAATAAT T GAT GCC C TTAAAAT GC TGAAGACAGATGTCATACCCACT GT G T AGCC 

S TGTGTATGACTTTTACTGAACACAGTTATGTTTTGAGGCAGCATGGTTTGATTAGCATTTCCGCA 
If! TC CATGCAAACGAGTCACATATGGTGGGACT GGAGCCATAGTAAAGGT T GAT T TACT T C TACCAA 

r\ CTAGTATATAAAGTACTAAT T AAATGC TAACAT AGGAAGT TAGAAAATACTAAT AAC T T T TAT TA 

2^ CT CAGC GATC TAT T CT TCTGATGC TAAATAAAT TAT AT AT CAGAAAACT T T CAATAT TGGTGAC T 

:5f ACC TAAAT GT GAT T TT T GC T GGT TACTAAAATAT T CT TAC CACT TAAAAGAGCAAGC TAAC ACAT 

C3 TGTCTTAAGCTGATCAGGGATTTTTTGTATATAAGTCTGTGTTAAATCTGTATAATTCAGTCGAT 
||| T T CAG T TCTGATAATGT TAAGAATAAC CAT TAT GAAAAGGAAAAT TTGT CCT GTAT AGCAT CAT T 

ATTTTTAGCCTTTCCTGTTAATAAAGCTTTACTATTCTGTCCTGGGCTTATATTACACATATAAC 
TGT TATT TAAATACTTAACCACTAATTT TGAAAAT TACCAGTGTGATACATAGGAATCAT TATTC 
AGAATGTAGTCTGGTCTTTAGGAAGTAT TAATAAGAAAATT TGCACATAACT TAGT TGAT TCAGA 
AAGGACTTGTATGCTGTTTT TCTCCCAAATGAAGACT CTTT TTGACAC TAAACACTT TT TAAAAA 
GC T TAT CTTTGCCTTCTC CAAACAAGAAGCAAT AGTC T CCAAGTCAATATAAAT T CT ACAGAAAA 
T AGTGT TCTTTTTC T CCAGAAAAATGC T TGT GAGAATCAT T AAAACAT G T GACAATT TAGAGAT T 
CTTTGTTT TAT T T C AC T GAT T AAT AT AC T G T G G CAAAT T AC AC AGAT TAT TAAAT T T TTTTACAA 
GAGTATAGTATATTTATTTGAAATGGGAAAAGTGCATTTTACTGTATTTTGTGTATTTTGTTTAT 
T TCTCAGAATAT GGAAAGAAAATTAAAATGTGTCAATAAAT AT T T T CTAGAGAG T AA 



APP ID=10063561 Page 257 of 318 



FIGURE 108 



MAREDSVKCLRCLLYALNLLFWLM 

VMIAVCCFL 1 1 VGMLGYCGTVKRNLLLLAWY FGS LLVI FCVE LACGVWT YE QELMVPVQWS DMVT 
LKARMTNYGLPRYRWLTHAWNFFQREFKCCGVVYFTDWLEMTEMDWPPDSCCVRE FPGCSKQAHQ 
EDLSDLYQEGCGKKMYSFLRGTKQLQVLRFLGISIGVTQILAMILTITLLWALYYDRREPGTDQM 
MSLKNDNSQHLSCPSVELLKPSLSRI FEHTSMANS FNTHFEMEEL 



Signal peptide: 

|!SS 

f*l amino acids 1-33 

O 

Ifl Transmembrane domains: 

JJf amino acids 12-35, 57-86, 94-114, 226-248 

I! I; 



Q 
111 
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CCAAGGCCAGAGCTGTGGACACCTTATCCCACTCATCCTCATCCTCTTCCTCTGATAAAGCCCCTACCAGTGCT 
GAT AAAGT C TT T CT C GT GAGAGC CT AG AGGC CT T AAAAAAAAAAGT GC T T GAAAGAGAAG G GGAC AAA GGAACA 
CCAGTATTAAGAGGATTTTCCAGTGTTTCTGGCAGTTGGTCCAGAAGGATGCCTCCATTCCTGCTTCTCACCTG 
CCTCTTCATCACAGGCACCTCCGTGTCACCCGTGGCCCTAGATCCTTGTTCTGCTTACATCAGCCTGAATGAGC 
CCTGGAGGAACACTGACCACCAGTTGGATGAGTCTCAAGGTCCTCCTCTATGT GACAACCATGT GAAT GGGGAG 
T GGT ACC AC TT C AC G GGC AT GGC GGGAGAT GCCATGCCTACCTTC T GCAT AC C AGAAAAC C AC T GT GG AAC CC A 
CGCACCTGTCTGGCTCAATGGCAGCCACCCCCTAGAAGGCGACGGCATTGTGCAACGCCAGGCTTGTGCCAGCT 
TCAATGGGAACTGCTGTCTCTGGAACACCACGGTGGAAGTCAAGGCTTGCCCTGGAGGCTACTATGTGTATCGT 
*H CT G ACCAAGC C C AGC GT CT GCT T CC AC GT CT ACT GT GGT C AT TT T TAT GAC AT CT GC GAC GAGGAC T GC C AT GG 

tit CAGCT GC TCAG AT AC CAGC GAGT GC ACAT GC GC TCC AGGAACT GT GC T AGGCCCT GAC AG GC AGAC AT GC T T T G 

IZ AT G AAAAT GAAT GT GAGCAAAAC AAC GGT GGCT GC AGT G AGAT C T GT GT GAACCT C AAAAACT C CT AC C GCT GT 

T s I GAGT GT GGGGT T GGC C GT GT GC T AAGAAGT GAT GGCAAG ACT T GT GAAGACGT T GAAGGAT GC C ACAAT AAC AA 

1^; TGGTGGCTGCAGCCACTCTTGCCTTGGATCTGAGAAAGGCTACCAGTGTGAATGTCCCCGGGGCCTGGTGCTGT 
ffl CTG AGGATAAC C AC ACT T GC C AAGT CCC T GT GT T GT GC AAAT C AAAT GCC AT T GAAGT GAAC AT CCCC AGGGAG 

CTGGTTGGTGGCCTGGAGCTCTTCCTGACCAACACCTCCTGCCGAGGAGTGTCCAACGGCACCCATGTCAACAT 
5 CCTCTTCTCTCTCAAGACATGTGGTACAGTGGTCGATGT GGT GAATGACAAGATTGTGGC CAGC AACCTCGTGA 

O CAGGTCTACCCAAGCAGACCCCGGGGAGCAGCGGGGACTTCATCATCCGAACCAGCAAGCTGCTGATCCCGGTG 
|f| ACCTGCGAGTTTCCACGCCTGTACACCATTTCTGAAGGATACGTTCCCAACCTTCGAAACTCCCCACTGGAAAT 
Q CAT GAGC CGAAATCAT GGGATCT TCCCATTC ACTCT GGAGAT CTT CAAGGACAAT GAGTT TGAAGAGCCTT ACC 

?|J GGGAAGCTCTGCCCACCCTCAAGCTTCGTGACTCCCTCTACTTTGGCATTGAGCCCGTGGTGCACGTGAGCGGC 
Is! T T GGAAAGC T T GGT GGAGAGC T GCT T T GCCAC C C CCAC C T CC AAGAT C GAC GAGGT CCT GAAAT AC T AC C T C AT 

III CC GGGAT GGC T GT GT T T C AG AT GAC T C GGTAAAGC AGTAC AC AT C CC G GGAT C ACCT AGC AAAGC ACT T C CAGG 

TCCCTGTCTTCAAGTTTGTGGGCAAAGACCACAAGGAAGTGTTTCTGCACTGCCGGGTTCTTGTCTGTGGAGTG 
TTGGACGAGCGTTCCCGCTGTGCCCAGGGTTGCCACCGGCGAATGCGTCGTGGGGCAGGAGGAGAGGACTCAGC 
CGGTCTACAGGGCCAGACGCTAACAGGCGGCCCGATCCGCATCGACTGGGAGGACTAGTTCGTAGCCATACCTC 
GAGTCCCTGCATTGGACGGCTCTGCTCTTTGGAGCTTCTCCCCCCACCGCCCTCTAAGAACATCTGCCAACAGC 
TGGGTTCAGACTTCACACTGTGAGTTCAGACTCCCAGCACCAACTCACTCTGATTCTGGTCCATTCAGTGGGCA 
CAGGTCACAGCACTGCTGAACAATGTGGCCTGGGTGGGGTTTCATCTTTCTAGGGTTGAAAACTAAACTGTCCA 
CCCAGAAAGACACTCACCCCATTTCCCTCATTTCTTTCCTACACTTAAATACCTCGTGTATGGTGCAATCAGAC 
C AC AAAAT CAGAAGCT GGGT AT AATATT TCAAGTT ACAAACCCT AGAAAAATT AAACAGTT ACT GAAAT TAT GA 
CTT AAAT ACC C AAT G ACT C C T T AAAT AT GT AAAT TAT AGT TAT ACCT T GAAAT T T C AAT T C AAAT GC AGACT AA 
TTATAGGGAATTTGGAAGT GT AT C AAT AAAAC AGT AT AT AAT T TT 
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MPPFLLLTCLFITGTSVSPVALDPCSAYISLNEPWRNTDHQLDESQGPPLCDNHVNGEWYHFTGMAGDAMP 
TFCIPENHCGTHAPVWLNGSHPLEGDGIVQRQACASFNGNCCLWNTTVEVKACPGGYYVYRLTKPSVCFHV 
YCGHFYDICDEDCHGSCSDTSECTCAPGTVLGPDRQTCFDENECEQNNGGCSEICVNLKNSYRCECGVGRV 
LRSDGKTCEDVEGCHNNNGGCSHSCLGSEKGYQCECPRGLVLSEDNHTCQVPVLCKSNAIEVNIPRELVGG 
LELFLTNTSCRGVSNGTHVNILFSLKTCGTWDVWDKIVASNLVTGLPKQTPGSSGDFIIRTSKLLIPVT 
CEFPRLYTISEGYVPNLRNSPLEIMSRNHGIFPFTLEIFKDNEFEEPYREALPTLKLRDSLYFGIEPWHV 
SGLESLVESCFATPTSKIDEVLKYYLIRIX5CVSDDSVKQYTSRDHLAKHFQVPVFKFVGKDHKEVFLHCRV 
L VCGVL DE RS RC AQGC HRRMRRGAGG E D S AGL QGQT LTGGPIRIDWED 



:: ;: 



Important features of the protein: 
Signal peptide: 

amino acids 1-16 

N-glycosylation sites. 

amino acids 89-93, 116-120, 259-263, 291-295, 299-303 



Tyrosine kinase phosphorylation sites. 

Q amino acids 411-418, 443-451 



Isl N-myristoylation sites. 



amino acids 226-232, 233-239, 240-246, 252-258, 296-302, 300-306, 
522-528, 531-537 

Aspartic acid and asparagine hydroxylation site . 

amino acids. 197-209 



ZP domain proteins. 

amino acids 431-457 



Calcium-binding EGF-like proteins. 

amino acids 191-212, 232-253 
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GAGAGAGGCAGCAGCT T GCT CAGC GG AC AAGG AT GC TGGGCGTG AGGG ACCAAGGCCT GCCCTGCACT CGG 
GCCTCCTCCAGCCAGTGCTGACCAGGGACTTCTGACCTGCTGGCCAGCCAGGACCTGTGTGGGGAGGCCCT 
CCTGCT GCCT TGGGGT GACAAT CT CAGCTCC AGG CT AC AGGG AGACCGGGAG GAT C ACAGAGCC AGC ATG T 
TACAGGATCCTGACAGTGATCAACCTCTGAACAGCCTCGATGTCAAACCCCTGCGCAAACCCCGTATCCCC 
AT GG AGAC CTTCAGAAAGGT GGGG AT CCCC AT C ATCAT AGC ACT ACTGAGCCTGGCGAGT ATC ATC ATT GT 
GGTTGTCCTCATCAAGGTGATTCTGGATAAATACTACTTCCTCTGCGGGCAGCCTCTCCACTTCATCCCGA 
GGAAGCAGCT GT GT GACGGAGAGCTGGACT GT CC CT TGGGGG AGGACG AGGAGC ACTGTGTC AAGAGCT T C 
CC CG AAGGGC CTGC AGT GGC AGTCCGCC TCTC C AAG GACCGAT C CACACTGC AGGT GCTGGACT CGGCCAC 
AGGGAACTGGTTCTCTGCCTGTTTCGACAACTTCACAGAAGCTCTCGCTGAGACAGCCTGTAGGCAGATGG 
GCT AC AGC AGAGCT GT GGAGAT TGGCCC AGACCAGGATCT GG AT GT T GTTGAAAT CAC AG AAAACAGCCAG 
GAGCTTCGCATGCGGAACTCAAGTGGGCCCTGTCTCTCAGGCTCCCTGGTCTCCCTGCACTGTCTTGCCTG 
TGGGAAGAGCCTGAAGACCCCCCGTGTGGTGGGTGGGGAGGAGGCCTCTGTGGATTCTTGGCCTTGGCAGG 
TCAGCATC CAGT AC GACAAAC AGCACGT CT GT GG AGGGAGCATCCT GG ACCCCC ACTGGGT CCT CACGGC A 
GCCCACTGCT T CAGGAAACAT ACCGATGT GT T CAACT GGAAGGT GCGG GCAGGCT C AG AC AAACTGGGC AG 
CT TC CCAT CGCT GGCT GT GGCC AAGATC AT CATC AT T G AATT CAACCC CATGTACCC C AAAGAC AATGAC A 
TCGCCCTCATGAAGCTGCAGTTCCCACTCACTTTCTCAGGCACAGTCAGGCCCATCTGTCTGCCCTTCTTT 
GATGAGGAGCTCACTCCAGCCACCCCACTCTGGATCATTGGATGGGGCTTTACGAAGCAGAATGGAGGGAA 
GATGTCT GACAT AC TGCTGCAGGCGT CAGT C CAGGT CAT T GACAGC AC ACGGTGCAAT GC AGACGATGCGT 
ACCAGGGGGAAGTCACCGAGAAGATGATGTGTGCAGGCATCCCGGAAGGGGGTGTGGACACCTGCCAGGGT 
GACAGTGGTGGGCCCCTGATGTACCAATCTGACCAGTGGCATGTGGTGGGCATCGTTAGCTGGGGCTATGG 
CT GCGGGGGC CCGAGC ACCCCAGGAGT AT ACACCAAGGTCT C AGCCT AT C TC AACT GGAT CT AC AATGT CT 
GG AAGGCTGAGCTGTAATGCTGCTGCCCCTTTGCAGTGCTGGGAGCCGCTTCCTTCCTGCCCTGCCCACCT 
GGGG ATCC CC C AAAGT CAGACAC AGAGC AAGAGT CCCC TT GGGTAC ACCC C T CT GCCC AC AGCCTC AGC AT 
TT CTTGGAGC AGCAAAGGGC CT C AATTC CT GT AAGAGACCCT CGCAGCCC AGAGGCGCCC AG AGGAAGT C A 
GCAGCCCTAGCTCGGCCACACTTGGTGCTCCCAGCATCCCAGGGAGAGACACAGCCCACTGAACAAGGTCT 
C AGGGGT ATT GCTAAGCC AAGAAGGAACTTT C CC ACAC T ACT GAAT GG AAGC AGGC TGT C T T GT AAAAGC C 
CAGATCACTGTGGGCTGGAGAGGAGAAGGAAAGGGTCTGCGCCAGCCCTGTCCGTCTTCACCCATCCCCAA 
GCCT ACT AGAGC AAGAAACCAGTTGTAATATAAAATGCACTGCCCTACTGTTGGTATGACTACCGTTACCT 
ACTGTTGTCATTGTTATTACAGCTATGGCCACTATTATTAAAGAGCTGTGTAACATCTCTGGCAAAAAAAA 
AAAA 
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MLQDPDS DQPLNS LDVKPLRKPRI PMET FRKVG I P 1 1 1 ALLSLAS 1 1 1 VWL I KVI LDKY YFLCG 

QPLHFIPRKQLCDGELDCPLGEDEEHCVKSFPEGPAYAVRLSKDRST.LQVLDSATGNWFSACFDN' 

FTEALAETACRQMGYSRAVEIGPDQDLDWEITENSQELRMRNSSGPCLSGSLVSLHCLACGKSL 

KTPRWGGEEASVDSWPWQVSIQYDKQHVCGGSILDPHWVLTAAHCFRKHTDVFNWKVRAGSDKL 

GSFPSLAVAKIIIIEFNPMYPKDNDIALMKLQFPLTFSGTVRPICLPFFDEELTPATPLWIIGWG 

FTKQNGGKMSDILLQASVQVIDSTRCNADDAYQGEVTEKMMCAGIPEGGVDTCQGDSGGPLMYQS 

DQWHWG I VSWG YGCGGPS T PGVYTKVSAYLNW I YNVWPCAE L 

Transmembrane domain: 

amino acids 32-53 (typell) 
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GGCTGGACTGGAACTCCTGGTCCCAAGTGATCCACCCGCCTC^GCCTCCCAAGGTGCTGTGATTA 
TAGGTGTAAGCCACCGTGTCTGGCCTCTGAACAACTTTTTCAGCAACTAAAAAAGCCACAGGAGT 
TGAACTGCTAGGATTCTGACTATGCTGTGGTGGCTAGTGCTCCTACTCCTACCTACATTAAAATC 
TGTTTTTTGTTCTCTTGTAACTAGCCTTTACCTTCCTAACACAGAGGATCTGTCACTGTGGCTCT 
GGCCCAAACCTGACCTTCACTCTGGAACGAGAACAGAGGTTTCTACCCACACCGTCCCCTCGAAG 
CCGGGGACAGCCTCACCTTGCTGGCCTCTCGCTGGAGCAGTGCCCTCACCAACTGTCTCACGTCT 
GGAGGCACTGACTCGGGCAGTGCAGGTAGCTGAGCCTCTTGGTAGCTGCGGCTTTCAAGGTGGGC 
CTTGCCCTGGCCGTAGAAGGGATTGACAAGCCCGAAGATTTCATAGGCGATGGCTCCCACTGCCC 
AGGCATCAGCCTTGCTGTAGTCAATCACTGCCCTGGGGCCAGGACGGGCCGTGGACACCTGCTCA 
GAAGCAGTGGGTGAGACATCACGCTGCCCGCCCATCTAACCTTTTCATGTCCTGCACATCACCTG 
ATCCATGGGCTAATCTGAACTCTGTCCCAAGGAACCCAGAGCTTGAGTGAGCTGTGGCTCAGACC 
CAGAAGGGGTCTGCTTAGACCACCTGGTTTATGTGACAGGACTTGCATTCTCCTGGAACATGAGG 
GAACGCCGGAGGAAAGCAAAGTGGCAGGGAAGGAACTTGTGCCAAATTATGGGTCAGAAAAGATG 
GAGGTG T TGGGT TAT CACAAGGCATCGAG TC T CC T GCATTCAGTGGACATGTGGGGGAAGGGC T G 
CCGATGGCGCATGACACACTCGGGACTCACCTCTGGGGCCATCAGACAGCCGTTTCCGCCCCGAT 
CCACGTACCAGCTGCTGAAGGGCAACTGCAGGCCGATGCTCTCATCAGCCAGGCAGCAGCCAAAA 
TCTGCGATCACCAGCCAGGGGCAGCCGTCTGGGAAGGAGCAAGCAAAGTGACCATTTCTCCTCCC 
CTCCTTCCCTCTGAGAGGCCCTCCTATGTCCCTACTAAAGCCACCAGCAAGACATAGCTGACAGG 
GGC TAATGGC T CAG TGT T GGCCCAGGAGGTCAGCAAGGCCTGAGAGC TGATCAGAAGGGCCT GC T 
GTGCGAACACGGAAATGCCTCCAGTAAGCACAGGCTGCAAAATCCCCAGGCAAAGGACTGTGTGG 
CTCAAT T TAAATCATGTTCTAGTAATTGGAGCTGT CCCCAAGACCAAAGGAGCTAGAGCTTGGT T 
CAAATGATCTCCAAGGGCCCT TATACCCCAGGAGACTT TGAT T TGAATTTGAAACCCCAAATCCA 
AACCTAAGAACCAGGTGCATTAAGAATCAGTTATTGCCGGGTGTGGTGGCCTGTAATGCCAACAT 
TTTGGGAGGCCGAGGCGGGTAGATCACCTGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGG 
TGAAACCCCTGTCTCTACTAAAAATACAAAAAAACTAGCCAGGCATGGTGGTGTGTGCCTGTATC 
CCAGC TACTCGGGAGGCT GAGACAGGAGAAT TACT T GAACCT GGGAGGT GAAGGAGGCT GAGACA 
GGAGAATCAC T TCAGCCT GAGCAACACAGCGAGACT CTGTCTCAGAAAAAATAAAAAAAGAATTA 
TGGT TAT T TGTAA 



APP ID=1 0063561 



Page 263 



FIGURE lid 

MLWWLVLLLLPTLKSVFCSLVTSLYLPNTEDLSLWLWPKPDLHSGTRTEVSTHTVPSKPGTASPC 
WPLAGAVPS PTVSRLEALTRAVQVAEPLGSCG FQGGPCPGRRRD 



Signal peptide: 

amino acids 1-15 



H 
Q 
Q 

m 
W 
if! 

Cfi 
u 

O 
111 

a 
ry 
a 
in 
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FIGURE lis 

CAGCAGTGGTCTCTCAGTCCTCTCAAAGCAAGGAAAGAGTACTGTGTGCTGAGAGACCATCGCAA 
AGAATCCT CCAGAGAAT TGTGAAGAC T G TCACAT TCTAAAT GCAGAAGC T T T TAAATCCAAGAAA 
ATATGTAAATCACTTAAGATTTGTGGACTGGTGTTTGGTATCCTGGCCCTAACTCTAATTGTCCT 
GTTT TGGGGGAGCAAGCACT TC TGGCCGGAGGTACCCAAAAAAGCCTATGACATGGAGCACACTT 
TCTACAGCAAT GGAGAGAAGAAGAAGAT T TACAT GGAAATT GATC CTGT GACCAGAAC T GAAATA 
T TCAGAAGCGGAAAT GGCACTGATGAAACAT TGGAAGTGCACGACTTTAAAAACGGATACACTGG 
y ; CATCTACTTCGTGGGTCTTCAAAAATGTTTTATCAAAACTCAGATTAAAGTGATTCCTGAATTTT 
Q CTGAACCAGAAGAGGAAATAGATGAGAATGAAGAAATTACCACAAGTTTCTTTGAACAGTCAGTG 
Q AT T TGGGT CCCAGCAGAAAAGCC TAT TGAAAACCGAGATT T T CT TAAAAAT TCCAAAAT T CT GGA 

U " GATT TGTGATAACGTGACCATGTATTGGATCAAT CCCACTCTAATATCAGTT TCTGAGTTACAAG 

I si 

fi AC T TTGAGGAGGAGGGAGAAGATC TT CACT T TCC TGCCAACGAAAAAAAAGGGAT T GAACAAAAT 

U jj 

ff| GAACAGTGGGTGG TCCCTCAAGTGAAAGTAGAGAAGAC C CG T CACGCCAGACAAGCAAGTGAGGA 

U AGAACTTCCAATAAATGACTATACTGAAAATGGAATAGAATTTGATCCCATGCTGGATGAGAGAG 

* GTTATTGTTGTATTTACTGCCGTCGAGGCAACCGCTATTGCCGCCGCGTCTGTGAACCTTTACTA 

;f;f GGCTACTACCCATATCCATACTGCTACCAAGGAGGACGAGTCATCTGTCGTGTCATCATGCCTTG 

jjS* TAACT GGT GGGT GGCC CGCAT GC TGGGGAGGGT C TAA TAGGAGGT T TGAGCT CAAAT GC T TAAAC 

ffj TGCTGGCAACATATAATAAAT GCATGCTAT TCAAT GAAT TTCTGCCTATGAGGCATCTGGCCCCT 

Q GGTAGCCAGCTCTCCAGAATTACTTGTAGGTAATTCCTCTCTTCATGTTCTAATAAACTTCTACA 

ill T TAT C ACCAAAAAAAAAAAAAAAAAAA 
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FIGURE 116 



MAKNPPENCEDCHILNAEIAFKSKKICKSLKICGLVFGILALTLIVLFWGSKHFWPEVPKKA.YDME 
HTFYSNGEKKKIYMEIDPVTRTEIFRSGNGTDETLEVHDFKNGYTGIYFVGLQKCFIKTQIKVIP 
E FS EPEEE I DENEE I TT T FFE QS VI WVPAEKP I ENRDFLKNSKI LE ICDNVTMYW INPTL I S VSE 
LQDFEEEGEDLHFPANEKKG I EQNEQWWPQVKVEKTRHARQAS EEELP INDYTENG I E FDPMLD 
ERGYCC I YCRRGNRYCRRVCEPLLGYYPYPYCYQGGRVI CRVIMPCNWWVARMLGRV 

Important features of the protein: 
Signal peptide: 

amino acids 1-40 

Transmembrane domain: 

amino acids 25-47 {type II) 

N-glycosylation sites. 

amino acids 94-97, 180-183 

Glycosaminoglycan attachment sites. 

amino acids 92-95, 70-73, 85-88, 133-136, 148-151, 192-195, 239- 
242 

N-myristoylation sites. 

amino acids 33-38, 95-100, 116-121, 215-220, 272-277 

Microbodies C-terminal targeting signal. 

amino acids 315-317 

Cytochrome c family heme-binding site signature. 

amino acids 9-14 
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FIGURE 117 



GAGCTCCCCTCAGGAGCGCGTTAGCTTCACACCTTCGGCAGCAGGAGGGCGGCAGCTTCTCGCAGGCGGCA 
GGGCGGGCGGCCAGGATCATGTCCACCACCACATGCCAAGTGGTGGCGTTCCTCCTGTCCATCCTGGGGCT 
GGCC GGCTGC ATCGCGGCC ACC GGGAT GGAC AT G TGGAGC AC CCAGGACCTGTACGAC AACCCCGT CACC T 
CCGT GT TC CAGT AC GAAGGGCT CT GGAGGAGCT GCGTG AGGC AG AGTT CAGGCT T C ACCG AATGCAGGCC C 
TATTTCACCATCCTGGGACTTCCAGCCATGCTGCAGGCAGTGCGAGCCCTGATGATCGTAGGCATCGTCCT 
GGGTGCCATTGGCCTCCTGGTATCCATCTTTGCCCTGAAATGCATCCGCATTGGCAGCATGGAGGACTCTG 
CC AAAGCCAACATGACACTGACCTCCGGGATC AT GT TC AT TGTCT C AGGTCT TT GT GC AATTGCTGGAGT G 
TCTGTGTTTGCCAACATGCTGGTGACTAACTTCTGGATGTCCACAGCTAACATGTACACCGGCATGGGTGG 
GATGGTGCAGACTGTTCAGACCAGGTACACATTTGGTGCGGCTCTGTTCGTGGGCTGGGTCGCTGGAGGCC 
TCACACT AAT T GGGGGT GTGAT GATGTGCATC GCCT GC CGGGGCCT GGC ACC AGAAGAAACC AACT ACAAA 
GC CG TTT CTT ATCATGCCTCAGGC CACAGT GT TGCCT AC AAGCCTGGAGGCT TC AAGG CC AGC ACT GGCT T 
TGGGTCCAACACCAAAAACAAGAAGATATACGATGGAGGTGCCCGCACAGAGGACGAGGTACAATCTTATC 
CT TCC AAGCACGAC T ATGTGTAAT GCT C T AAG ACCTCT CAGC ACGGGCGGAAGAAACT CCCG GAGAGCT C A 
CCCAAAAAAC AAGG AGATCCCATCTAGAT TT C TT CTTGCTT T TG ACT C AC AGCT GGAAGT T AGAAAAGCCT 
CGATTTCATCTTTGGAGAGGCCAAATGGTCTTAGCCTCAGTCTCTGTCTCTAAATATTCCACCATAAAACA 
GCTG AGTT AT T TAT GAATTAGAGGCT AT AGCT C ACAT T TT CAAT CC TCTATT T CT T TT TT T AAAT AT AAC T 
T T CT ACTCTGATGAGAGAAT GT GGTT TT AATCTCTCT C TC ACAT TT T GAT GATT T AGACAGACT CC CCCT C 
T T CC T C CT AGTC AATAAACC CATTGATG ATCT AT TT CCCAGCTT AT CCCC AAGAAAACTT TT GAAAGG AAA 
GAGTAGACCCAAAGATGTTATTTTCTGCTGTTTGAATTTTGTCTCCCCACCCCCAACTTGGCTAGTAATAA 
AC ACTT ACTG AAGAAGAAGC AATAAG AG AAAGAT ATTT GT AATC TCTCC AGC CCAT GATCT C GGTT TTC T T 
ACACTGTGATCTTAAAAGTTACCAAACCAAAGTCATTTTCAGTTTGAGGCAACCAAACCTTTCTACTGCTG 
TTGACATCTTCTTATTACAGCAACACCATTCTAGGAGTTTCCTGAGCTCTCCACTGGAGTCCTCTTTCTGT 
CGCGGGTCAGAAATTGTCCCTAGATGAATGAGAAAATTATTTTTTTTAATTTAAGTCCTAAATATAGTTAA 
AAT AAAT AAT GT T T TAGT AAAATG AT AC ACT ATCTCT GTGAAAT AGCCTC AC CCCT AC ATGT GGAT AGAAG 
GAAATGAAAAAATAATTGCTTTGACATTGTCTATATGGTACTTTGTAAAGTCATGCTTAAGTACAAATTCC 
AT GAAAAGCT CAC ACCT GT AAT CCT AGC ACT T TGGGAGGCT GAGGAGG AAGGATC ACT T G AGCCCAGAAGT 
TCGAGACT AGCCTGGGCAAC AT GGAG AAGCCC TG TCT C TACAAAAT AC AG AGAGAAAAAAT C AGC C AGTC A 
TG GT GGCAT AC ACC TGT AGTCC C AGCAT TCCGGGAG GCTGAGGT GG GAGG AT CACT TG AGCCCAGGGAGGT 
TGGGGCTGCAGTGAGCCATGATCACACCACTGCACTCCAGCCAGGTGACATAGCGAGATCCTGTCTAAAAA 
AAT AAAAAAT AAAT AATGGAAC AC AGCAAGT C CT AGGAAGT AGGTT AAAACT AATT CT TT AA 
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FIGURE 118 



MSTTTCQWAFLLSILGLAGCIAATGMDMWSTQDLYDNPVTSVFQYEGLWRSCVRQSSGFTECRP 
YFT ILGLPAMLQAVRALMI VG I VLGAI GLLVS I FALKC IRI GSMEDSAKANMTLTSGIMFIVSGL 
CAI AGVS VFANMLVTN FWMS TANMYTGMGGMVQTVQTRYT FGAAL FVGWVAGGLT L I GGVMMC I A 
CRGLAPEETNYKAVSYHASGHSVAYKPGGFKASTGFGSNTKMKKIYDGGARTEDEVQSYPSKHDY 
V 

Signal peptide: 

f*% amino acids 1-23 

yl Transmembrane domains: 

W amino acids 81-100, 121-141, 173-194 

tn 
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FIGURE HQ 



GGAAAAACTGTTCTCTTCTGTGGCACAGAGAACCCTGCTTCAAAGCAGAAGTAGCAGTTCCGGAGTCC 
AGC T G GC T AAAAC T CAT CC C AGAG GAT AATGGC AAC C CAT G CC T TAGAAAT CG CT GGGCTGTTTCTTG 
GTGGTGTTGGAATGGTGGGCACAGTGGCTGTCACTGTCATGCCTCAGTGGAGAGTGTCGGCCTTCATT 
GAAAACAACATCGTGGTTTTT GAAAAC T TCT GG GAAGGACTGT GGAT GAATT GCGT GAGGCAGGCTAA 
CATCAGGAT GCAGTGCAAAAT CTAT GAT TCCCT GCTGGCT CTT T CTCCGGACCTACAGGCAGCCAGAG 
GACTGATGTGTGCTGCTTCCGTGATGTCCTTCTTGGCTTTCATGATGGCCATCCTTGGCATGAAATGC 
ACCAGGTGCACGGGGGACAATGAGAAGGTGAAGGCTCACATTCTGCTGACGGCTGGAATCATCTTCAT 
CATCACGGGCAT GGT GGTGCT CAT CCCT GTGAGCT GGGTT GCCAATGCCAT CATCAGAGATTTCTATA 
ACTCAATAGTGAATGTTGCCCAAAAACGTGAGCTTGGAGAAGCTCTCTACTTAGGATGGACCACGGCA 
CTGGTGCTGATTGTTGGAGGAGCTCTGTTCTGCTGCGTTTTTTGTTGCAACGAAAAGAGCAGTAGCTA 
CAGATACTCGATACCT TCCCATCGCACAACCCAAAAAAGT TAT CACACCGGAAAGAAGTCACCGAGCG 
T CTACTCCAGAAGTCAGTATGTG TAG TT GTGTATGTT TTTT TAACTT TACTATAAAGCC AT GCAAATG 
ACAAAAATCTATATTACTTTCTCAAAATGGACCCCAAAGAAACTTTGATTTACTGTTCTTAACTGCCT 
AAT CTTAAT TACAGGAACT GTGCAT CAGCTATTTATGATT CTATAAGCTATT T CAGCAGAATGAGATA 
TTAAACCCAATGCTTTGATTGTTCTAGAAAGTATAGTAATTTGTTTTCTAAGGTGGTTCAAGCATCTA 
CTCTTTTTATCAT TTACTT CAAAAT GACATT GCTAAAGACT GCAT TATTTTACTACT GTAATTT CTCC 
ACGACATAGCATTATGTACATAGATGAGTGTAACATTTATATCTCACATAGAGACATGCTTATATGGT 
TTTAT TTAAAAT GAAATGCCAGT CCATT ACACTGAATAAATAGAACT CAACTATT GCTTT T CAGGGAA 
ATCATGGATAGGGTTGAAGAAGGTTACTATTAATTGTTTAAAAACAGCTTAGGGATTAATGTCCTCCA 
TTTATAATGAAGATTAAAATGAAGGCTT TAAT CAGCATT GTAAAGGAAAT T GAAT GGCTT TCTGATAT 
GCTGTTTTTTAGCCTAGGAGTTAGAAATCCTAACTTCTTTATCCTCTTCTCCCAGAGGCTTTTTTTTT 
CTTGT GTAT TAAATTAACAT TTT TAAAACGCAGATAT TTTGTCAAGGGGCTT TGCAT TCAAACT GCT T 
TT C CAGGGCTATACT CAGAAGAAAGATAAAAGTGT GATCT AAGAAAAAGTGATGGT TTTAGGAAAGTG 
AAAATAT T T T T GT TT T TGT AT TT GAAGAAGAAT GAT G CAT TTT GACAAGAAAT CAT AT AT G TAT G GAT 
ATATTTTAATAAGTATTTGAGTACAGACTTTGAGGTTTCATCAATATAAATAAAAGAGCAGAAAAATA 
TGTCTTGGTTTTCATTTGCTTACCAAAAAAACAACAACAAAAAAAGTTGTCCTTTGAGAACTTCACCT 
GCTCCTATGTGGGTACCTGAGTCAAAATTGTCATTTTTGTTCTGTGAAAAATAAATTTCCTTCTTGTA 
CCATTTCTGTTTAGTTTTACTAAAATCTGTAAATACTGTATTTTTCTGTTTATTCCAAATTTGATGAA 
ACTGACAATCCAATTTGAAAGTTTGTGTCGACGTCTGTCTAGCTTAAATGAATGTGTTCTATTTGCTT 
TATACAT TT ATATTAATAAAT TGTACATTTT T CTAAT T 
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FIGURE 120 

MATHALEIAGLFLGGVGMVGTVAVTVMPQWRVSAFIENNIWFENFWEGLWMNCVRQANIRMQCK 
IYDSLIJyiiSPDLQAARGLMCAASVMS 

MWL I PVSWVANAI I RDFYNS I VNVAQKRELGEALYLGWTTALVLIVGGALFCCVFCCNEKSSSY 
RYS I PSHRT TQKS YHTGKKS PSVYSRS QYV 

Signal peptide: 

amino acids 1-17 



u 
O 

rji Transmembrane domains: 

CO amino acids 82-101, 118-145, 164-188 

u 
m 
m 

.3 

C3 
111 
9 

ru 
■ru 
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FIGURE 121 



GGAGAGAGGCGCGCGGGTGAAAGGCGCATTGATGCAGCCTGCGGCGGCCTCGGAGCGCGGCGGAG 
CCAGACGCTGACCACGTTCCTCTCCTCGGTCTCCTCCGCCTCCAGCTCCGCGCTGCCCGGCAGCC 
GGGAGCCATGCGACCCCAGGGCCCCGCCGCCTCCCCGCAGCGGCTCCGCGGCCTCCTGCTGCTCC 
TGCTGCTGCAGCTGCCCGCGCCGTCGAGCGCCTCTGAGATCCCCAAGGGGAAGCAAAAGGCGCAG 
CTCCGGCAGAGGGAGGTGGTGGACCTGTATAATGGAATGTGCTTACAAGGGCCAGCAGGAGTGCC 
TGGTCGAGACGGGAGCCCTGGGGCCAATGTTATTCCGGGTACACCTGGGATCCCAGGTCGGGATG 
GATTCAAAGGAGAAAAGGGGGAATGTCTGAGGGAAAGCTTTGAGGAGTCCTGGACACCCAACTAC 
AAGCAGTGTTCATGGAGTTCATTGAATTATGGCATAGATCTTGGGAAAATTGCGGAGTGTACATT 
|*| TACAAAGAT GCGT T CAAATAGTGC TC TAAGAGT T T TGT TCAGTGGCTCACT T CGGCTAAAAT GCA 

p! GAAATGCATGCTGTCAGCGTTGGTATTTCACATTCAATGGAGCTGAATGTTCAGGACCTCTTCCC 

U! AT TGAAGC TATAAT T TATT TGGACCAAGGAAGCC CT GAAAT GAAT TCAACAAT TAATAT T CATCG 

P. t% 

!!f CACTTCTTCTGTGGAAGGACTTTGTGAAGGAATTGGTGCTGGATTAGTGGATGTTGCTATCTGGG 

10 

5 TTGGCACTTGTTCAGATTACCCAAAAGGAGATGCTTCTACTGGATGGAATTCAGTTTCTCGCATC 
% AT TAT TGAAGAAC TACCAAA ATAA ATGC T T TAAT T T T CATT TGC TACC T C T T T TT T TAT TAT GCC 

Q T TGGAATGGTT CACT T AAAT GACATT T TAAATAAG TT TATGTATACAT CTGAATGAAAAGCAAAG 

Ml CTAAAT AT GT T TACAGAC CAAAG TGT GAT T T CACACT G T TT T TAAATC TAGCAT TAT T CAT T T TG 

2*A C T TCAATCAAAAGT GG T T TCAATATT T T TT T TAGT TGGT TAGAATACT T TC T TCATAGT CACAT T 

CTCTCAACCTATAATTTGGAATATTGTTGTGGTCTTTTGTTTTTTCTCTTAGTATAGCATTTTTA 
f|| AAAAAATATAAAAGCT ACCAATC T T TGTACAAT T T GTAAAT GT TAAGAAT T T T T T T TATATCT GT 

TAAATAAAAATTATTTCCAACA 
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FIGURE 122 



MRPQGPAASPQRLRGLLLLLLLQLPAPSSASEIPKGKQKAQLRQREWDLYNGMCLQGPAGVPGR 
DGSPGANVIPGTPGIPGRDGFKGEKGECLRESFEESWTPNYKQCSWSSLNYGIDLGKIAECTFTK 
MRSNSALRVLFSGSLRLKCRNACCQRWYFTFNGAECSGPLPIEAIIYLDQGSPEMNSTINIHRTS 
SVEGLCEGIGAGLVDVAIWGTCSDYPKGDASTGWNSVSRIIIEELPK 

Signal peptide: 

amino acids 1-30 

Transmembrane domain: 

amino acids 195-217 
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FIGURE- 12a 

GCTGAGCGTGTGCGCGGTACGGGGCTCTCCTGCCTTCTGGGCTCCAACGCAGCTCTGTGGCTGAA 
CTGGGTGCTCATCACGGGAACTGCTGGGCTATGGAATACAGATGTGGCAGCTCAGGTAGCCCCAA 
AT TGCCTGGAAGAATACAT CATGT T T T TCGATAAGAAGAAATTG TAGGAT C CAGT T T T T T T T T T A 
ACCGCCCCCTCCCCACCCCCCAAAAAAACTGTAAAGATGCAAAAACGTAATATCCATGAAGATCC 
TATTACCTAGGAAGATTTTGATGTTTTGCTGCGAATGCGGTGTTGGGATTTATTTGTTCTTGGAG 
TGTTCTGCGTGGCTGGCAAAGAATAATGTTCCAAAATCGGTCCATCTCCCAAGGGGTCCAATTTT 
TCTTCCTGGGTGTCAGCGAGCCCTGACTCACTACAGTGCAGCTGACAGGGGCTGTCATGCAACTG 
GCCCCTAAGCCAAAGCAAAAGACCTAAGGACGACCT TTGAACAATACAAAGGATGGGTT TCAATG 

TAAT TAGGC TAC T GAGCGGATCAGC T G TAGCACT GGTT AT AGCC CCCAC TGT CT TACT GACAATG 
CTTTCTTCTGCCGAACGAGGATGCCCTAAGGGCTGTAGGTGTGAAGGCAAAATGGTATATTGTGA 
ATCTCAGAAATTACAGGAGATACCCTCAAGTATATCTGCTGGTTGCTTAGGTTTGTCCCTTCGCT 
ATAACAGCC TTCAAAAACT TAAGTATAAT CAAT T TAAAGGGC TCAACCAGC T CACC TGGCTATAC 
CTTGACCATAACCATAT CAGCAATATTGACGAAAATGCTTT TAATGGAATACGCAGACTCAAAGA 
GCTGAT T CT TAGT T CCAAT AGAATCTCC TAT TT TCT TAACAATACC T T CAGAC CT GT GACAAAT T 
TACGGAACTTGGATCTGTCCTATAATCAGCTGCATTCTCTGGGATCTGAACAGTTTCGGGGCTTG 
C GGAAGC TGC T GAGT T TACATT TACGGTCTAACT CCCT GAGAACCAT CCC T GT GCGAAT AT T CCA 
AGACTGCCGCAACCTGGAACTTTTGGACCTGGGATATAACCGGATCCGAAGTTTAGCCAGGAATG 
TCTTTGCTGGCATGATCAGACTCAAAGAACTTCACCTGGAGCACAATCAATTTTCCAAGCTCAAC 
CTGGCCCTTTTTCCAAGGTTGGTCAGCCTTCAGAACCTTTACTTGCAGTGGAATAAAATCAGTGT 
CATAGGACAGACCATGTCCT GGACCT GGAGC TCC T T ACAAAGGCT TGAT T TAT CAGGCAATGAGA 
TCGAAGCTTTCAGTGGACCCAGTGTTTTCCAGTGTGTCCCGAATCTGCAGCGCCTCAACCTGGAT 
TCCAACAAGC TCACAT T TAT T GG TCAAGAGATT T T GGAT TC TTGGATAT CCC T CAAT GACAT GAG 
TCTTGCTGGGAATATATGGGAATGCAGCAGAAATATTTGCTCCCTTGTAAACTGGCTGAAAAGTT 
TTAAAGGTCTAAGGGAGAATACAATTATCTGTGCCAGTCCCAAAGAGCTGCAAGGAGTAAATGTG 
ATC GATGCAGT GAAGAACT ACAGCAT CT GTGGCAAAAGTACTACAGAGAGGT T TGAT CT GGCCAG 
GGCTCTCCCAAAGCCGACGTTTAAGCCCAAGCTCCCCAGGCCGAAGCATGAGAGCAAACCCCCTT 
TGCCCCCGACGGTGGGAGCCACAGAGCCCGGCCCAGAGACCGATGCTGACGCCGAGCACATCTCT 
TTCCATAAAATCATCGCGGGCAGCGTGGCGCTTTTCCTGTCCGTGCTCGTCATCCTGCTGGTTAT 
CTACGTGTCATGGAAGCGGTACCCTGCGAGCATGAAGCAGCTGCAGCAGCGCTCCCTCATGCGAA 
GGCACAGGAAAAAGAAAAGACAGTCCCTAAAGCAAATGACTCCCAGCACCCAGGAAT TT TATGTA 
GAT TATAAACCCACCAACACGGAGACCAGCGAGATGC T GCT GAAT GGGACGGGACCC TGCACC TA 
TAACAAATCGGGCTCCAGGGAGTGTGAGGT ATGAA CCATTGTGATAAAAAGAGCTCTTAAAAGCT 
GGGAAATAAGTGGTGCTTTATTGAACTCTGGTGACTATCAAGGGAACGCGATGCCCCCCCTCCCC 
TTCCCTCTCCCTCTCACTTTGGTGGCAAGATCCTTCCTTGTCCGTTTTAGTGCATTCATAATACT 
GGTCATTTTCCTCTCATACATAATCAACCCATTGAAATTTAAATACCACAATCAATGTGAAGCTT 
GAACTCCGGTTT7^ATATAATACCTATTGTATAAGACCCTTTACTGATTCCATTAATGTCGCATTT 
GT T T TAAGATAAAACT T CT T T CATAGGTAAAAAAAAAAA 



FIGURE 124 

MGFNVIRLLSGSAVALVI APTVLLTMLS SAERGCPKGCRCEGKMVYCES QKLQE IPSSI SAGCLG 
LSLRYNSLQKLKYNQFKGLNQLTWLYLDHNHISNIDENAFKTGIRRLKELILSSNRISYFLNNTFR 
PVTNLRNLDLS YNQLHS LGSE QFRGLRKLLS LHLRSNSLRT I PVR I FQDCRNLELLDLGYNRI RS 
I^OTFAGMIRLKELHLEHNQFSKLNIAL^ 

S GNE I EAFS GPSVFQCVPNLQRLNLDSNKLT FI GQE I LDSWI S LNDI S LAGN I WECSRNI CSLVN 
WLKSFKGLRENTIICASPKELQGVNVIDAVKNYSICGKSTTERFDIiARALPKPTFKPKLPRPKHE 
SKPPLPPTVGATEPGPETDADAEHISFHKIIAGSVALFLSVLV.ILLVIYVSWKRYPASMKQLQQR 

g sis 

SLMRRHRKKKRQSLKQMTPSTQEFYVDYKPTNTETSEMLLNGTGPCTYNKSGSRECEV 



%n 



Important features of the protein: 
Signal peptide: 

amino acids 1-33 

Transmembrane domain: 

amino acids 420-442 

N-glycosylation sites. 



Fi amino acids 126-129, 357-360, 496-499, 504-507 



cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 4 65-468 



Tyrosine kinase phosphorylation site. 

amino acids 136-142 



N-myristoylation sites. 

amino acids 11-16, 33-38, 245-250, 332-337, 497-502, 507-512 
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FIGURE 12* 



CCGTTATCGTCTTGCGCTACTGCTG AATG TCCGTCCCGGAGGAGGAGGAGAGGCTTTTGCCGCTG 
ACCCAGAGATGGCCCCGAGCGAGCAAATTCCTACTGTCCGGCTGCGCGGCTACCGTGGCCGAGCT 
AGCAACCTTTCCCCTGGATCTCACAAAAACTCGACTCCAAATGCAAGGAGAAGCAGCTCTTGCTC 
GGT T GG GAGAC G G T GC AAGAGAAT CTGCCCCC TAT AGG G GAAT GG T GC G CAC AGCC C T AGG GAT C 
AT TGAAGAGGAAGGCTT TCTAAAGCTT TGGCAAGGAGTGACACCCGCCAT TTACAGACACGTAGT 
GTATTCTGGAGGTCGAATGGTCACATATGAACATCTCCGAGAGGTTGTGTTTGGCAAAAGTGAAG 
ATGAGCATTATCCCCTTTGGA2\ATCAGTCATTGGAGGGATGATGGCTGGTGTTATTGGCCAGTTT 
T TAGCCAATCCAAC TGACC TAGTGAAGGT T CAGAT GCAAAT GGAAGGAAAAAGGAAACT GGAAGG 

if"! 

It AAAACCATTGCGATTTCGTGGTGTACATCATGCATTTGCAAAAATCTTAGCTGAAGGAGGAATAC 
ffl GAGGGC T T TGGGCAGGC TGGGTACCCAATATACAAAGAGCAGCAC TGGTGAATAT GGGAGATT TA 

hi AC CAC T TATGATACAGT GAAACACTAC TTGGTAT TGAATACACCAC TT GAGGACAAT AT CAT GAC 

|y TCACGGTTTATCAAGTTTATGTTCTGGACTGGTAGCTTCTATTCTGGGAACACCAGCCGATGTCA 
7 " TCAAAAGCAGAATAATGAATCAACCACGAGATAAACAAGGAAGGGGACTTTTGTATAAATCATCG 
ACTGACTGCTTGATTCAGGCTGTTCAAGGTGAAGGATTCATGAGTCTATATAAAGGCTTTTTACC 
Q AT C TT GGC TGAGAAT GACCCCT T GGTCAATGGT GT TC T GGC T TAC TTAT GAAAAAAT CAGAGAGA 

Iff TGAGTGGAGT CAGT C CATT T TAA. 

i%$ 

I 
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FIGURE 126 



MSVPEEEERLLPLTQRWPRASKFLLSGCARTVAEIiATFPLDLTKTRLQMQGEAALARLGDGARES 
APYRGMVRTALGIIEEEGFLKLWQGVTPAIYRHVWSGGRMVTYEHLREWFGKSEDEHYPLWKS 
VI GGMMAGVI GQFLANP TDLVKVQMQME GKRKLE GKPLRFRGVHHAFAK ILAEGG I RGLWAGWVP 
NIQRAALVNMGDLTTYDTVKHYLVLNTPLEDNIMTHGLSSLCSGLVASILGTPADVIKSRIMNQP 
RDKQGRGLLYKS S TDCL I QAVQGEG FMS LYKG FLPS WLRMTPWSMVFWL T YEKI REMSGVS P F 

Transmembrane domains: 

!U amino acids 25-38, 130-147, 233-248 

If! 



o 



m 
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FIGURE 127 



CGCGGATCGGACCCAAGCAGGTCGGCGGCGGCGGCAGGAGAGCGGCCGGGCGTCAGCTCCTCGAC 
CCCCGTGTCGGGCTAGTCCAGCGAGGCGGACGGGCGGCGTGGGCCCATGGCCAGGCCCGGCATGG 
AGCGGTGGCGCGACCGGCTGGCGCTGGTGACGGGGGCCTCGGGGGGCATCGGCGCGGCCGTGGCC 
CGGGCCCTGGTCCAGCAGGGACTGAAGGTGGTGGGCTGCGCCCGCACTGTGGGCAACATCGAGGA 
GCTGGCTGCTGAATGTAAGAGTGCAGGCTACCCCGGGACTTTGATCCCCTACAGATGTGACCTAT 
CAAATGAAGAGGACATCCTCTCCATGTTCTCAGCTATCCGTTCTCAGCACAGCGGTGTAGACATC 
TGCATCAACAATGCTGGCTTGGCCCGGCCTGACACCCTGCTCTCAGGCAGCACCAGTGGTTGGAA 
IZ GGACATGTTCAATGTGAACGTGCTGGCCCTCAGCATCTGCACACGGGAAGCCTACCAGTCCATGA 
f| AGGAGCGGAATGTGGACGATGGGCACATCATTAACATCAATAGCATGTCTGGCCACCGAGTGTTA 
ffl CCCCTGTCTGTGACCCACTTCTATAGTGCCACCAAGTATGCCGTCACTGCGCTGACAGAGGGACT 
Ui GAGGCAAGAGCTTCGGGAGGCCCAGACCCACATCCGAGCCACGTGCATCTCTCCAGGTGTGGTGG 
:L" AGACACAAT TCGCC TT CAAAC TCCAC GACAAGGAC CC TGAGAAGGCAGC TGCCACCT AT GAGCAA 

f " ATGAAGTGTCTCAAACCCGAGGATGTGGCCGAGGCTGTTATCTACGTCCTCAGCACCCCCGCACA 
a CATCCAGAT T GGAGACATCCAGATGAGGCCCACGGAGCAGGT GACC TAGTGACTG TGGGAGC T CC 

O TCCTTCCCTCCCCACCCTTCATGGCTTGCCTCCTGCCTCTGGATTTTAGGTGTTGATTTCTGGAT 
VI CACGGGATACCACTTCCTGTCCACACCCCGACCAGGGGCTAGAAAATTTGTTTGAGATTTTTATA 
y TCATCTTGTCAAATTGCTTCAGTTGTAAATGTGAAAAATGGGCTGGGGAAAGGAGGTGGTGTCCC 
P5 TAATTGTTTTACTTGTTAACTTGTTCTTGTGCCCCTGGGCACTTGGCCTTTGTCTGCTCTCAGTG 
fll TCTTCCCTTTGACATGGGAAAGGAGTTGTGGCCAAAATCCCCATCTTCTTGCACCTCAACGTCTG 
TGGCTCAGGGCTGGGGTGGCAGAGGGAGGCCTTCACCTTATATCTGTGTTGTTATCCAGGGCTCC 
AGACT.TCCTCCTCTGCCTGCCCCACTGCACCCTCTCCCCCTTATCTATCTCCTTCTCGGCTCCCC 
AGCCCAGTCTTGGCTTCTTGTCCCCTCCTGGGGTCATCCCTCCACTCTGACTCTGACTATGGCAG 

CAGAACACCAGGGCCTGGCCCAGTGGATTTCATGGTGATCATTAAAAAAGAAAAATCGCAACCAA 
AAAAAAAAAA 
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FIGURE 128 



MARPGMERWRDRLALVT GAS GG I GAAVARALVQQGLKOTGCART VGN I EELAAE CKSAGYPGTL I 
P YRCDL SNEED I LSMFSAI RSQHS GVDI C INNAGLARPDTLLS GS TSGWKDMFNVOTLALS I CTR 
EAYQSMKERNVDDGH 1 1 NI NSMS GHRVL PLS VTHFYSATKYAVTALTEGLRQELREAQTH IRATC 
ISPGWETQFAFKLHDKDPEKAAATYEQMKCLKPEDVAEAVIYVLSTPAHIQIGDIQMRPTEQVT 

Important features of* the protein: 
Signal peptide: 

Li, 

* amino acids 1-17 

ff! N-myristoylation sites. 

W amino acids 18-24, 21-27, 22-28, 24-30, 40-46, 90-96, 109-115, 

yi 199-205 

^ Short-chain alcohol dehyrogenase . 

Q amino acids 30-42, 104-114 
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FIGURE 12Q 



•i-.r 



AACTTCTACATGGGCCTCCTGCTGCTGGTGCTCTTCCTCAGCCTCCTGCCGGTGGCCTACACCAT 
CATGTCCCTCCCACCCTCCTTTGACTGCGGGCCGTTCAGGTGCAGAGTCTCAGTTGCCCGGGAGC 
ACCTCCCCTCCCGAGGCAGTCTGCTCAGAGGGCCTCGGCCCAGAATTCCAGTTCTGGTTTCATGC 
CAGCCTGTAAAAGGCCATGGAACTTTGGGTGAATCACCGATGCCATTTAAGAGGGTTTTCTGCCA 
GGATGGAAATGTTAGGTCGTTCTGTGTCTGCGCTGTTCATTTCAGTAGCCACCAGCCACCTGTGG 
CCGT TGAGT GC T T GAAATGAGGAACTGAGAAAAT TAAT TT C T CATGTAT TT T TCT CAT T TAT TTA 
T TAAT TT T TAACT GATAGT TGTACATAT T T GGGGG TACAT G T GAT AT T T GGATACATGTATACAA 
TATATAATGAT CAAAT CAGGGTAACTGGGATAT CCAT CACAT CAAACAT TTAT TT T TT AT TC TT T 
TTAGACAGAGTCTCACTCTGTCACCCAGGCTGGAGTGCAGTGGTGCCATCTCAGCTTACTGCAAC 
CTCTGCCTGCCAGGTTCAAGCGATTCTCATGCCTCCACCTCCCAAGTAGCTGGGACTACAGGCAT 
GCACCACAATGCCCAAC TAAT T T T TG TAT T T T TAGTAGAGACGGGGT T T TGCCATGT TGCCCAGG 
CTGGCCTTGAACTCCTGGCCTCAAACAATCCACTTGCCTCGGCCTCCCAAAGTGTTATGATTACA 
GGCGTGAGCCACCGTGCCTGGCCTAAACATTTATCTTTTCTTTGTGTTGGGAACTTTGAAATTAT 
ACAATGAATTATTGTTAACTGTCATCTCCCTGCTGTGCTATGGAACACTGGGACTTCTTCCCTCT 
ATCTAACTGTATATTTGTACCAGTTAACCAACCGTACTTCATCCCCACTCCTCTCTATCCTTCCC 
AACCTCTGATCACCTCATTCTACTCTCTACCTCCATGAGATCCACTTTTTTAGCTCCCACATGTG 
AGT AAGAAAAT GCAAT AT TTGTCTTTCTGT GCC T GGCT TAT T TCACT TAACATAAT GAC TTCCTG 
TTCCATCCATGTTGCTGCAAATGACAGGATTTCGTTCTTAATTTCAATTAAAATAACCACACATG 
GCAAAAA 
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FIGURE 13Q 

MGLLLLVLFLSLLPVAYTIMSLPPSFDCGPFRCRVSVAREHLPSRGSLLRGPRPRIPVLVSCQPV 
KGHGTLGESPMPFKRVFCQDGNVRSFCVCAVHFSSHQPPVAVECLK 

Important features of the protein: 
Signal peptide: 

amino acids 1-18 

N-myristoylation site. 

amino acids 86-92 

Zinc carboxypeptidases , zinc-binding region 2 signature. 

amino acids 68-79 
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FIGURE lfti 



TT CTGAAGTAACGG AAGC TACCTTGT AT AAAGACCTCAAC ACT GCT GACCATGATCAGCGCAGCCT GGAGC 
ATCTTCCTCATCGGGACTAAAATTGGGCTGTTCCTTCAAGTAGCACCTCTATCAGTTATGGCTAAATCCTG 
TCCATCT GTGTGT C GCT GCG AT GCGGGT TTCATT TACT GT AAT GAT CGCT TT C TGAC ATC C ATT CC AACAG 
GAAT ACC AGAGG AT GC T AC AACTCTCT ACCTT C AGAAC AACC AAAT AAAT AAT GCT GGGAT T CCTT CAGAT 
TT GAAAAACT T GCT GAAAGT AGAAAGAATAT ACCTAT ACC AC AACAGT TT AG ATGAAT TT CC TACC AACCT 
CCCAAAGT AT GT AAAAGAGT T ACATT T GCAAGAAAAT AAC AT AAGGACT ATC ACT T AT GAT T CACT TT CAA 
AAAT TCCC T ATCT GGAAG AATT AC AT TT AGAT GACAAC TCT GTC TCTGC AGT T AGC AT AG AAGAGGGAGC A 
■ . TTCCGAGACAGCAACTATCTCCGACTGCTTTTCCTGTCCCGTAATCACCTTAGCACAATTCCCTGGGGTTT 
f*% GCCCAGGACT AT AGAAGAAC T ACGCT T GGATGAT AATCGC AT AT CC ACT ATT TC AT CACC AT CT CTT C AAG 

Q GT CT CACT AGTCTAAAACGC CT GGTT CT AGAT GG AAAC CT GTTGAACAAT CATGGT TT AGGT GAC AAAGT T 

ffl TTCTTCAACCTAGTTAATTTGACAGAGCTGTCCCTGGTGCGGAATTCCCTGACTGCTGCACCAGTAAACCT 
ty TCCAGGC ACAAACC TG AGGAAGCTTT AT CTTC AAGAT AAC CAC ATC AATCGGGT GCCCCC AAAT GC TT TT T 

Mi CTTATCTAAGGCAGCTCTATCGACTGGATATGTCCAATAATAACCTAAGTAATTTACCTCAGGGTATCTTT 

fill 

; :i ■- GATG ATTT GG AC AAT AT AAC AC AACT GATT CT TCGCAACAATCC CT GGT ATT GCGGGTGC AAGAT G AAAT G 

GGTACGTGACT GGT TAC AAT CACT ACCT GT GAAGGTCAACGTGC GT GGGCTC ATGT GCCAAGCCCC AG AAA 
^ AG GT TCGT GGGAT GGCT AT T AAGGAT CT CAAT GCAGAACT GTTT GAT TGT AAGGAC AGTGGGAT TGTAAGC 

?S AC CATTCAGAT AACCACTGC AATACCCAACAC AGTGTATCCTGC CC AAGGAC AGT GGCCAGC TCCAGT GAC 

i% CAAACAGC CAGAT ATT AAGAACCCCAAGCTCACT AAGGAT CAAC AAAC CAC AGGGAGT CCCT C AAGAAAAA 

m CAAT TACAAT T ACT GT GAAGTCTGTC AC CT CT GATACC AT T CAT AT CTCT TGGAAACT TGCT CT ACCT AT G 

f!J AC TG CTTT GAGACT C AGC T GGCTT AAAC TGGGCC ATAGCC CGGC AT T TGG AT CT AT AACAGAAACAAT TGT 

f!J AAC AGGGGAACGC AGT GAGT ACTTGGTC ACAGCC CT GGAGC CTGAT T C ACCCT AT AAAGT AT GC AT GGT T C 

CC AT GGAAACC AGC AACCT C TACCT ATT TGAT GAAACT CC T GTT TGT ATT GAGACT GAAACT GC ACCCCTT 
CGAATGTACAACCCTACAACCACCCTCAATCGAGAGCAAGAGAAAGAACCTTACAAAAACCCCAATTTACC 
TTTGGCTGCCATCATTGGTGGGGCTGTGGCCCTGGTTACCATTGCCCTTCTTGCTTTAGTGTGTTGGTATG 
TTCATAGGAATGGATCGCTCTTCTCAAGGAACTGTGCATATAGCAAAGGGAGGAGAAGAAAGGATGACTAT 
GCAGAAGC TGGCAC T AAG AAGGAC AACT CT AT CCTGGAAATC AGGGAAACTT CTT T TC AG AT GT TACC AAT 
AAGCAATGAACCCATCTCGAAGGAGGAGTTTGTAATACACACCATATTTCCTCCTAATGGAATGAATCTGT 
ACAAAAAC AAT CAC AG T GAAAGC AGT AGT AAC CGAAGC TACAGAGAC AGT GG TAT T CC AG AC TCAGAT CAC 
TCAC ACTC ATGAT GCT GAAGGACTCACAGC AG ACTTGT GT TT TGGGT T T T TT AAAC CT AAGGGAGGTG AT G 
GT 
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FIGURE 132 

MISAAWSIFLIGTKIGLFLQVAPLSVMAKSCPSVCRCDAGFIYCNDRFLTSIPTGIPEDATTLYL 
QNNQINNAGI PS DLKNLLKVERI YLYHNSLDE FPTNLPKYVKELHLQENNIRT I TYDSLSKI PYL 
EELHLDDNSVSAVS IEEGAFRDSNYLRLLFLSRNHLST I PWGLPRT IEELRLDDNRI ST ISSPSL 
QGLTSLKRLVLDGNLLNNHGLGDKVFFNLTOLTELSLVRNSLTAAPWLPGTNLRKLYLQDNHIN 
RVPPNAFSYLRQLYRLDMSNmLSNLPQGIFDDLDNITQLILRNNPWYCGCKMKWVRDWLQSLPV 
KVNVRGLMCQAPEKVRGMAIKDLNAELFDCKDSGIVSTIQITTAIPNTVYPAQGQWPAPVTKQPD 
IKNPKLTKDQQTT GS PSRKT I T I TVKS VTSDT IHI SWKLALPMTALRLSWLKLGHS PAFGS I TET 
IVTGERSEYLVTALEPDSPYKVCMVPMETSNLYLFDETPVCIETETAPLRMYNPTTTLNREQEKE 
PYKNPNLPLAAI I GGAVALVTIALLALVCWYVHRNGSLFSRNCAYSKGRRRKDDYAEAGTKKDNS 
ILE IRETS FQMLPI SNEPI SKEEFVIHT I FPPNGMNLYKNNHSES S SNRS YRDSGI PDSDHSHS 

Important features of the protein: 
Signal peptide: 

amino acids 1-28 
Transmembrane domain: 
amino acids 531-552 
N-glycosylation sites. 

amino acids 226-229, 282-285, 296-299, 555-558, 626-629, 633-636 
Tyrosine kinase phosphorylation site, 
amino acids 515-522 
N-myristoylation sites. 

amino acids 12-17, 172-177, 208-213, 359-364, 534-539, 556-561, 
640-645 

Amidation site. 

amino acids 567-570 
Leucine zipper pattern. 

amino acids 159-180 

Phospholipase A2 aspartic acid active site. 

amino acids 34-44 
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FIGURE 133 



CCGTCATCCCCCTGCAGCCACCCTTCCCAGAGTCCTTTGCCCAGGCCACCGCAGGCTTCTTGGCA 
GCCCTGCCGGGCCACTTGTCTTCATGTCTGCCAGGGGGAGGTGGGAAGGAGGTGGGAGGAGGGCG 
TGCAGAGGCAGTC T GGGCT T GGCCAGAGCTCAGGGTGC T GAGCGTG TGACCAGCAGT GAGCAGAG 
GCCGGCCATGGCCAGCCTGGGGCTGCTGCTCCTGCTCTTACTGACAGCACTGCCACCGCTGTGGT 
CCTCCTCACTGCCTGGGCTGGACACTGCTGAAAGTAAAGCCACCATTGCAGACCTGATCCTGTCT 
GCGCTGGAGAGAGCCACCGTCTTCCTAGAACAGAGGCTGCCTGAAATCAACCTGGATGGCATGGT 
GGGGGTCCGAGTGCTGGAAGAGCAGCTAAAAAGTGTCCGGGAGAAGTGGGCCCAGGAGCCCCTGC 
j»t TGCAGCCGCTGAGCCTGCGCGTGGGGATGCTGGGGGAGAAGCTGGAGGCTGCCATCCAGAGATCC 
Q C T CCAC TACC TCAAGC T GAGT GATCCCAAGTACCTAAGAGAG T TCCAGC TGACCCT CCAGCCCGG 

SH GTTTTGGAAGCTCCCACATGCCTGGATCCACACTGATGCCTCCTTGGTGTACCCCACGTTCGGGC 
CCCAGGACTCATTCTCAGAGGAGAGAAGTGACGTGTGCCTGGTGCAGCTGCTGGGAACCGGGACG 
GACAGCAGCGAGCCCTGCGGCCTCTCAGACCTCTGCAGGAGCCTCATGACCAAGCCCGGCTGCTC 
|a AGGCTACTGCCTGTCCCACCAACTGCTCTTCTTCCTCTGGGCCAGAATGAGGGGATGCACACAGG 
3 GACCACTCCAACAGAGGCAGGACTATATCAACCTCTTCTGCGCCAACATGATGGACTTGAACCGC 
C~| AGAGC TGAGGCCAT CGGATACGCC TACCC TACCCGGGACAT C T TCATGGAAAACAT CAT G TTCT G 

»^ TGGAATGGGCGGCTTCTCCGACTTCTACAAGCTCCGGTGGCTGGAGGCCATTCTCAGCTGGCAGA 
J! AACAGCAGGAAGGATGCTTCGGGGAGCCTGATGCTGAAGATGAAGAATTATCTAAAGCTATTCAA 
#«* TATCAGCAGCAT T TTTCGAGGAGAGTGAAGAGGCGAGAAAAACAAT TTCCAGAT TCTCGCTCTGT 

flj TGCTCAGGCTGGAGTACAGTGGCGCAATCTCGGCTCACTGCAACCTTTGCCTCCTGGGTTCAAGC 
AATTCTCTTGCCTCATCCTCCCGAGTAGCTGGGACTACAGGAGCGTGCCACCATACCTGGCTAAT 
TTTTATATTTTTTTAGTAGAGACAGGGTTTCATCATGTTGCTCATGCTGGTCTCGAACTCCTGAT 
CTCAAGAGATCCGCCCACCTCAGGCTCCCAAAGTGTGGGATTATAGGTGTGAGCCACCGTGTCTG 
GC T GAAAAGCAC T T T CAAAGAGAC TGTGTTGAATAAAGGGCCAAGGT TC T TGCCACCCAGCACTC 
ATGGGGGCTCTCTCCCCTAGATGGCTGCTCCTCCCACAACACAGCCACAGCAGTGGCAGCCCTGG 
GT GGC T TCCTATACAT CCTGGCAGAATACCCCCCAGCAAACAGAGAGCCACAC CCAT CCACAC CG 
CCACCACCAAGCAGCCGCTGAGACGGACGGTTCCATGCCAGCTGCCTGGAGGAGGAACAGACCCC 
T T TAGTCCT CAT CC CT TAGAT CC TGGAGGGCACGGAT CACAT CCT GGGAAGAAGGCATC T GGAGG 
ATAAGCAAAGCCACCCCGACACCCAATCTTGGAAGCCCTGAGTAGGCAGGGCCAGGGTAGGTGGG 
GGCCGGGAGGGACCCA.GGTGTGAACGGATGAATAAAGTTCAACTGCAACTGAAAAAAAAAAA 
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FIGURE 134 

MSARGRWEGGGRRACRGSLGIARAQGAERVTSSEQRPAMASLGLLLLLLLTALPPLWSSSLPGLD 
TAESKATIADLILSALERATVFLEQRLPEINLDGMVGVRVLEEQLKSVREKWAQEPLLQPLSLRV 
GMLGEKLEAAI QRSLHYLKLS DPKYLRE FQLTLQPG FWKLPHAW I HTDAS LVYPT FGPQDS FSEE 
RSDVCLVQLLGTGTDSSEPCGLSDLCRSLMTKPGCSGYCLSHQLLFFLWARMRGCTQGPLQQSQD 
YINLFCANMMDLNRRAEAIGYAYPTRDIFMENIMFCGMGGFSDFYKLRWLEAILSWQKQQEGCFG 
EPDAEDEELSKAIQYQQHFSRRVKRREKQFPDSRSVAQAGVQWRNLGSLQPLPPGFKQFSCLILP 
SSWDYRSVPPYLANFYIFLVETGFHHVAHAGLELLISRDPPTSGSQSVGL 



|3 Important features of the protein: 

?M Signal peptide: 

;;:! amino acids 1-26 



Transmembrane domain: 

amino acids 39-56 

Tyrosine kinase phosphorylation sites. 

amino acids 149-156, 274-282 

N-myristoylation sites. 

amino acids 10-16, 20-26, 63-69, 208-214 



Amidation site. 

amino acids 10-14 



Glycoprotein hormones beta chain signature 1. 

amino acids 230-237 
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FIGURE iar 

GGTCTGAGTGCAGAGCTGCTGTCATGGCGGCCGCTCTGTGGGGCTTCTTTCCCGTCCTGCTGCTG 
CTGCTGCTATCGGGGGATGTCCAGAGCTCGGAGGTGCCCGGGGCTGCTGCTGAGGGATCGGGAGG 
GAGTGGGGTCGGCATAGGAGATCGCTTCAAGATTGAGGGGCGTGCAGTTGTTCCAGGGGTGAAGC 
CTCAGGACTGGATCTCGGCGGCCCGAGTGCTGGTAGACGGAGAAGAGCACGTCGGTTTCCTTAAG 
ACAGATGGGAGTTTTGTGGTTCATGATATACCTTCTGGATCTTATGTAGTGGAAGTTGTATCTCC 
AGC TTACAGATT TGATCCCGT TCGAGTGGATATCACTTCGAAAGGAAAAATGAGAGCAAGATATG 
TGAATTACATCAAAACATCAGAGGTTGTCAGACTGCCCTATCCTCTCCAAATGAAATCTTCAGGT 
U % CCACC T TCT TACT T TAT TAAAAGGGAATCGTGGGGCT GGACAGAC T TTC TAATGAAC CCAAT GG T 

p TATGATGATGGTTCTTCCTTTATTGATATTTGTGCTTCTGCCTAAAGTGGTCAACACAAGTGATC 
Cfl CT GACAT GAGACGGGAAAT GGAGCAGTCAAT GAATAT GC TGAATTCCAACCAT GAGT TGCCT GAT 

^ GT TTCT GAGT TCATGACAAGAC TC T T C T C T TCAAAAT CATCTGGCAAAT C TAGCAGCGGCAGCAG 

m 

JJ: TAAAACAGGCAAAAGTGGGGCTGGCAAAAGGAGG TAG TCAGGCCGTCCAGAGCTGGCATTTGCAC 
AAACACGGCAACACT GGGT GGCATCCAAGTC T TGGAAAACC GT GT GAAGCAAC TACTAT AAAC TT 
i GAGTCATCCCGACGTTGATCTCTTACAACTGTGTATGTT 

13 AAC TT T T TAGCACAT GT T T T GTAC T TGGTACACGAGAAAACCCAGC TT TCATC T TT TGT C T GTAT 

^5 GAGGTCAATATTGATGTCACTGAATTAATTACAGTGTCCTATAGAAAATGCCATTT^ATAAATTAT 
ATGAACTAC TAT ACAT TAT GTATAT TAAT TAAAACATC T T AAT CCAGAAATCAAAAAAAAAAAAA 
pi AAAAAAAAAAAAAAA 

?ta 

3 
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FIGURE 136 

MAAALWGFFPVLLLLLLSGDVQSSEVPGAAAEGSGGSGVGI.GDRFKIEGRAVVPGVKPQDWISAA 
RVLVDGEEHVGFLKTDGSFVVHDIPSGSYVVEWSPAYRFDPVRVDITSKGKMRARYVNYIKTSE 
WRLPYPLQMKSSGPPSYFIKRESWGWTDFLMNPMVMM^LPLLIFVLLPKVWTSDPD 
QSMNMLNSNHELPDVSEFMTRLFSSKSSGKSSSGSSKTGKSGAGKRR 

Important features of the protein: 
Signal sequence: 

amino acids 1-23 

Transmembrane domain: 

amino acids 161-182 

N-glycosylation site. 

amino acids 184-187 

Glycosaminoglycan attachment sites. 

amino acids 37-40, 236-239 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 151-154 

N-myristoylation sites. 

amino acids 33-38, 36-41, 38-44, 229-234 

Amidation site. 

amino acids 238-241 

ATP/GTP-binding site motif A <P-loop) . 

amino acids 229-236 
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FIGURE 137 

GATGGCGCAGCCACAGCTTCTGTGAGATTCGATTTCTCCCCAGTTCCCCTGTGGGTCTGAGGGGA 
CCAGAAGGGTGAGCTACGTTGGCTTTCTGGAAGGGGAGGCTATATGCGTCAATTCCCCAAAACAA 
GTTTTGACATTTCCCCTGAAATGTCATTCTCTATCTATTCACTGCAAGTGCCTGCTGTTCCAGGC 
CTTACC TGCTGGGCACTAACGGCGGAGCCAGGAT GGGGACAGAATAAAGGAGCCACGACCT GT GC 
CACCAACTCGCACTCAGACTCTGAACTCAGACCTGAAATCTTCTCTTCACGGGAGGCTTGGCAGT 
TTTTCTTACTCCTGTGGTCTCCAGATTTCAGGCCTAAGATGAAAGCCTCTAGTCTTGCCTTCAGC 
CTTCTCTCTGCTGCGTTTTATCTCCTATGGACTCCTTCCACTGGACTGAAGACACTCAATTTGGG 
AAGCTGT GT GAT CGCCACAAACC T T CAGGAAA.T ACGAAATGGATT T TC TGAGAT AC GGGGCAGT G 
Q TGCAAGCCAAAGAT GGAAACATT GACATCAGAATCTTAAGGAGGAC TGAGTC T T TGGAAGACACA 

CO AAGCCTGCGAATCGATGCTGCCTCCTGCGCCATTTGCTAAGACTCTATCTGGACAGGGTATTTAA 
AAACTAC CAGACCCCTGACCATTATAC TCTCCGGAAGAT CAGCAGCCT CGCCAATT CCT T T CT TA 
CCATCAAGAAGGACCTCCGGCTCTCTCATGCCCACATGACATGCCATTGTGGGGAGGAAGCAATG 
jy : AAGAAAT ACAGCCAGAT TCTGAGTCACT T TGAAAAGCT GGAACC T CAGGC AGCAGT T GT GAAGGC 

3 T TT GGGGGAAC TAGACAT T C T TC TGCAAT GGATGGAGGAGACAGA ATAG GAGGAAAGTGAT GCT G 

O CTGC TAAGAAT AT T CGAGGT CAAGAGCT CCAG TC T TCAAT ACCTGCAGAGGAGGCAT GACCCCAA 

J[; ACCACCATCTCTTTACTGTACTAGTCTTGTGCTGGTCACAGTGTATCTTATTTATGCATTACTTG 
!fj CTTCCT TGCATGAT TGTCTT TATGCATCCCCAAT CT TAATTGAGACCATACTTGTATAAGATTTT 

TGTAATATC T TT CT GC TATT GGATAT AT T TAT TAGT TAAT ATAT T TAT T TAT T T T T T GCTAT T TA 
III ATGTAT TTAT TT TT TTACTTGGACATGAAACT TTAAAAAAAT TCACAGATTATATT TATAACCTG 

ACTAGAGCAGG T GATGTATT T T TATACAGTAAAAAAAAAAAACCT T G TAAAT T CTAGAAGAGT GG 
CTAGGGGGG T TATT CATT TGTAT TCAAC TAAGGACATAT T TAC TCAT GC T GAT GCT CT GT GAGAT 
AT T TGAAAT TGAACCAAT GAC TAC T TAGGATGGGT T GT GGAATAAGT TT TGAT GTGGAAT T GCAC 
ATCTACCTTACAATTACTGACCATCCCCAGTAGACTCCCCAGTCCCATAATTGTGTATCTTCCAG 
CCAGGAATCCTACACGGCCAGCAT GTAT T TC T ACAAAT AAAGT TT TCTT TGCAT AC CAAAAAAAA 
AAAAAAAAAAA 
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FIGURE 138 

MRQFPKTSFDISPEMSFSIYSLQVPAVPGLTCWALTAEPGWGQNKGATTCATNSHSDSELRPEIF 
SSREAWQFFLLLWSPDFRPKMKASSLAFSLLSAAFYLLWTPSTGLKTLNLGSCVIATNLQEIRNG 
FSEIRGSVQAKDGNIDIRILRRTESLQDTKPANRCCLLRHLLRLYLDRVFKNYQTPDHYTLRKIS 
SI^SFLTIKKDLRLSHAmTCHCGEEAMKKYSQILSHFEKLEPQAAVVKALGELDILLQWMEET 
E 

Important features of the protein: 
Signal peptide: 

amino acids 1-42 

cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 192-195, 225-228 

N-myristoylation sites. 

amino acids 42-47, 46-51, 136-141 
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FIGURE 13Q 



CCTGGAGCCGGAAGCGCGGCTGCAGCAGGGCGAGGCTCCAGGTGGGGTCGGTTCCGCATCCAGCC 
TAGCGTGTCCACGATGCGGCTGGGCTCCGGGACTTTCGCTACCTGTTGCGTAGCGATCGAGGTGC 
TAGGGATCGCGGTCTTCCTTCGGGGATTCTTCCCGGCTCCCGTTCGTTCCTCTGCCAGAGCGGAA 
CACGGAGCGGAGCCCCCAGCGCCCGAACCCTCGGCTGGAGCCAGTTCTAACTGGACCACGCTGCC 
ACCACC T C T C T TCAGTAAAGT TGT TAT TGT T C T GATAGATGCC T TGAGAGAT GAT TTTGTGTTTG 
GGTCAAAGGGTGTGAAATTTATGCCCTACACAACTTACCTTGTGGAAAAAGGAGCATCTCACAGT 
T T TGTGGC T GAAGCAAAGC CACC TACAGT TAC TATGCCT CGAAT CAAGGCAT TGAT GACGGGGAG 
CCTTCCTGGCTTTGTCGACGTCATCAGGAACCTCAATTCTCCTGCACTGCTGGAAGACAGTGTGA 
TAAGACAAGCAAAAGCAGCTGGAAAAAGAATAGT C T T T TAT GGAGATGAAACC TGGGT TAAAT TA 
TTCCCAAAGCAT T T T GT GGAATAT GAT GGAACAACC T CATT T T TCGTGT CAGAT TACACAGAGGT 
H GGATAATAATGTCACGAGGCATT T GGATAAAGTATT AAAAAGAGGAGAT T GGGACATAT TAAT CC 

Q TCCACTACCTGGGGCTGGACCACATTGGCCACATTTCAGGGCCCAACAGCCCCCTGATTGGGCAG 
Q AAGCTGAGCGAGATGGACAGCGTGCTGATGAAGATCCACACCTCACTGCAGTCGAAGGAGAGAGA 
m GACGCCTTTACCCAATTTGCTGGTTCTTTGTGGTGACCATGGCATGTCTGAAACAGGAAGTCACG 
fl GGGCCTCCTCCACCGAGGAGGTGAATACACCTCTGATTTTAATCAGTTCTGCGTTTGAAAGG7WV 
%U CCCGGTGATATCCGACATCCAAAGCACGTCCAATAGACGGATGTGGCTGCGACACTGGCGATAGC 
111 ACTTGGCTTACCGATTCCAAAAGACAGTGTAGGGAGCCTCCTATTCCCAGTTGTGGAAGGAAGAC 
J|| CAATGAGAGAGCAGTTGAGAT TTTTACATT TGAATACAGTGCAGC TTAGTAAACTGTTGCAAGAG 

y, AAT GT GCCGTCATAT GAAAAAGATCC T GGGT T TGAGCAGTT TAAAATGTCAGAAAGAT T GCAT GG 

GAACTGGATCAGACTGTACTTGGAGGAAAAGCATTCAGAAGTCCTATTCAACCTGGGCTCCAAGG 
TTCTCAGGCAGTACCTGGATGCTCTGAAGACGCTGAGCTTGTCCCTGAGTGCACAAGTGGCCCAG 
13 TTCTCACCCTGCTCCTGCTCAGCGTCCCACAGGCACTGCACAGAAAGGCTGAGCTGGAAGTCCCA 
Iff CTGTCATCTCCTGGGTTTTCTCTGCTCTTTTATTTGGTGATCCTGGTTCTTTCGGCCGTTCACGT 
ft- CATTGTGTGCACCTCAGCTGAAAGTTCGTGCTACTTCTGTGGCCTCTCGTGGCTGGCGGCAGGCT 
if* GCCTTTCGTTTACCAGACTCTGGTTGAACACCTGGTGTGTGCCAAGTGCTGGCAGTGCCCTGGAC 
^ AGGGGGC C T CAG GGAAG GAC G T GGAG C AGC C T TAT C C C AGG C C T C T GGGT GT C C C GAC AC AGG T G 

IJ TTCACATCTGTGCTGTCAGGTCAGATGCCTCAGTTCTTGGAAAGCTAGGTTCCTGCGACTGTTAC 
f|| CAAGGTGATTGTAAAGAGCTGGCGGTCACAGAGGAACAAGCCCCCCAGCTGAGGGGGTGTGTGAA 
TCGGACAGCC TCCCAGCAGAGGT GTGGGAGC T GCAGC T GAGGGAAGAAGAGACAAT CGGCCT GGA 
CACTCAGGAGGGTCAAAAGGAGACTTGGTCGCACCACTCATCCTGCCACCCCCAGAATGCATCCT 
GCCTCATCAGGTCCAGATTTCTTTCCAAGGCGGACGTTTTCTGTTGGAATTCTTAGTCCTTGGCC 
TCGGACACCTTCATTCGTTAGCTGGGGAGTGGTGGTGAGGCAGTGAAGAAGAGGCGGATGGTCAC 
ACTCAGATCCACAGAGCCCAGGATCAAGGGACCCACTGCAGTGGCAGCAGGACTGTTGGGCCCCC 
ACCCCAACCCTGCACAGCCCTCATGCCCTCTTGGCTTGAGCCGTCAGAGGCCCTGTGCTGAGTGT 
C T GACCGAGACAC T CACAGCT T TGTCAT CAGGGCACAGGCT T CCTC GGAGCCAGGAT GAT CT GT G 
CCACGCTTGCACCTCGGGCCCATCTGGGCTCATGCTCTCTCTCCTGCTATTGAATTAGTACCTAG 
C T GCACACAG T AT GT AGT TACCAAAAGAATAAAC GGCAAT AAT TGAGAAAAAAAA 
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FIGURE mo 



MRLGSGTFATCCVAIEVLGIAVFLRGFFPAPVRSSARAEHGAEPPAPEPSAGASSNWTTLPPPLF 
SKWI VL I DALRDD FVFGSKGVKEMPYTT YLVEKGASHS FYAEAKP PT VTMPR I KALMTGS LPGF 
TOVIRNLNSPALLEDSVIRQAKAAGKRIVFYGDETWWLFPKHFVEYDGTTSFFVSDYTEVDNNV 
TRHLDKVLKRGDWDILILHYLGLDHIGHISGPNSPLIGQKLSEMDSVLMKIHTSLQSKERETPLP 
NLLVLCGDHGMSETGSHGASSTEEVNTPLILISSAFERKPGDIRHPKHVQ 



ff! 



J.r, 



Important features of the protein: 
Signal peptide: 

amino acids 1-34 

Transmembrane domain: 

amino acids 58-76 



N-glycosylation sites. 

13 amino acids 56-60, 194-198 



N-myristoylation sites. 

amino acids 6-12, 52-58, 100-106, 125-131, 233-239, 270-276, 
275-281, 278-284 



Amidation site. 

amino acids 154-158 



Cell attachment sequence . 

amino acids 205-208 
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FIGURE 141 



GGCACGAGGCAAGCCT TCCAGGT TATCGTGACGCACCTT GAAAGTCTGAGAGC TACT GCCCTACA 
GAAAGTTACTAGTGCCCTAAAGCTGGCGCTGGCACTGATGTTACTGCTGCTGTTGGAGTACAACT 
TCCCTATAGAAAACAACTGCCAGCACC TTAAGACCACTCACACCT TCAGAGTGAAGAACTT AAAC 
C C GAAGAAAT T C AGCAT T CAT GAC C AGGAT CACAAAG TACT G G T C C T G GAC T C T G G GAAT C T CAT 
AGCAGT TCCAGAT AAAAACT ACATACGCCCAGAGATCT T CT T TGCAT TAGCCT CAT CC T T GAGC T 
CAGCCTCTGCGGAGAAAGGAAGTCCGATTCTCCTGGGGGTCTCTAAAGGGGAGTTTTGTCTCTAC 
TG TGACAAGGATAAAGGACAAAGTCAT CCATCCCT TCAGCT GAAGAAGGAGAAAC T GATGAAGCT 

O GGCTGCCCAAAAGGAATCAGCACGCCGGCCCTTCATCTTTTATAGGGCTCAGGTGGGCTCCTGGA 

: «? ACATGCTGGAGTCGGCGGCTCACCCCGGATGGTTCATCTGCACCTCCTGCAATTGTAATGAGCCT 
GT T GGGGTGACAGATAAAT T T GAGAACAGGAAACACAT T GAAT T T TCAT T TCAACCAGT T TGCAA 

T?t AGCTGAAATGAGCCCCAGTGAGGTCAGCGATTAGGAAACTGCCCCATTGAACGCCTTCCTCGCTA 

|f| ATTTGAACTAATTGTATAAPAACACCAAACCTGCTCACT 

J* 

f.\ t 
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FIGURE 142 

MLLLLLE YNFP IENNCQHLKT THT FRVKNLNPKKFS I HDQDHKVLVLDS GNL I AVPDKNYIRPE I 
FFALAS S LS S AS AEKGS P I LLGVSKGEFCL YCDKDKGQSHPS LQLKKEKLMKLAAQKES ARRPFI 
FYRAQVGSWNMLESAAHPGWFI C TS CNCNE PVGVTDKFENRKH IE FS FQPVCKAEMS PSEVSD 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 33-36 

N-myristoylation site. 

amino acids 50-55, 87-92 

Interleukin-1 

amino acids 37-182 



■Pi; 
I Hi 



o 
fll 
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FIGURE 143 

C TAGAGAGTAT AGGGCAGAAGGATGGCAGATGAGTGACT CCACATCCAGAGC T GCC TCCC TT TAA 
TCCAGGATCCTGTCCTTCCTGTCCTGTAGGAGTGCCTGTTGCCAGTGTGGGGTGAGACAAGTTTG 
TCCCACAGGGCTGTCTGAGCAGATAAGATTAAGGGCTGGGTCTGTGCTCAATTAACTCCTGTGGG 
CACGGGGGCTGGGAAGAGCAAAGTCAGCGGTGCCTACAGTCAGCACCATGCTGGGCCTGCCGTGG 
AAGGGAGGTCTGTCCTGGGCGCTGCTGCTGCTTCTCTTAGGCTCCCAGATCCTGCTGATCTATGC 
CTGGCATTTCCACGAGCAAAGGGACTGTGATGAACACAATGTCATGGCTCGTTACCTCCCTGCCA 
CAG TGGAGT T TGC T GT CCACACAT TCAACCAACAGAGCAAGGAC TAC TATGCC T ACAGAC TGGGG 
C AC AT C T T GAAT T C C T G G AAG GAGCAGG T G GAG T C C AAG AC T G TAT T C T C AAT G G AGC TAC T G C T 
Q GGGGAGAACTAGGTGTGGGAAATT TGAAGACGACATTGACAACTGCCATT TCCAAGAAAGCACAG 

Cfl AGC T GAACAATACT TT CACC TGC T TCT T CACCAT CAGCACCAGGCCCT GGATGACT CAGT TCAGC 

Pf CTCC TGAACAAGACCT GCTT GGAGGGATT CCACTGAGT GAAACCCAC T CACAGGCT T GT CCAT GT 

JL: GCTGCTCCCACATTCCGTGGACATCAGCACTACTCTCCTGAGGACTCTTCAGTGGCTGAGCAGCT 

M :! 

|\ TTGGACTTGTTTGTTATCCTATTTTGCATGTGTTTGAGATCTCAGATCAGTGTTTTAGAAAATCC 

5 - ACACAT C TT GAGCC TAAT CAT GTAG TGTAGAT CAT TAAACAT CAGCAT T T TAAGAAAAAAAAAAA 

C3 AAAAAAAAAAA&AAAAAAAAAAAftAAAAAAAAAAAAAAAAAAAAAAAAA^ 

U% ■ 
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FIGURE 144 

MLGLPWKGGLSWALLLLLLGSQILLIYAWHFHEQRDCDEHNVMARYLPATVEFAVHTFNQQSKDY 
YAYRLGHILNSWKEQVESKTVFSMELLLGRTRCGKFEDDIDNCHFQESTELNNTFTCFFTISTRP 
WMTQFSLLNKTCLEGFH 

Important features of the protein: 
Signal peptide: 

amino acids 1-25 

N-glycosylation sites. 

amino acids 117-121, 139-143 

N-myristoylation site. 

amino acids 9-15 
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FIGURE 



C TGTGCAGC T CGAGGC T CCAGAGGCACAC TCCAGAGAGAGCCAAGGT TC TGACGCG ATGA GGAAG 
CACCTGAGCTGGTGGTGGCTGGCCACTGTCTGCATGCTGCTCTTCAGCCACCTCTCTGCGGTCCA 
GACGAGGGGCATCAAGCACAGAATCAAGTGGAACCGGAAGGCCCTGCCCAGCACTGCCCAGATCA 
CT GAGGCCCAGGT GGCT GAGAACCGCCCGGGAGCC TTCATCAAGCAAGGCCGCAAGC T C GACAT T 
GACTTCGGAGCCGAGGGCAACAGGTACTACGAGGCCAACTACTGGCAGTTCCCCGATGGCATCCA 
CTACAACGGCTGCTCTGAGGCTAATGTGACCAAGGAGGCATTTGTCACCGGCTGCATCAATGCCA 
CCCAGGCGGCGAACCAGGGGGAGTTCCAGAAGCCAGACAACAAGCTCCACCAGCAGGTGCTCTGG 
CGGCTGGTCCAGGAGCTCTGCTCCCTCAAGCATTGCGAGTTTTGGTTGGAGAGGGGCGCAGGACT 
TCGGGTCACCATGCACCAGCCAGTGCTCCTCTGCCTTCTGGCTTTGATCTGGCTCATGGTGAAAT 
AAGCT T GCCAGGAGGC T GGCAGTACAGAGCGCAGCAGC GAGCAAATCCT GGCAAGT GACCCAGCT 
CTTCTCCCCCAAACCCACGCGTGTTCTGAAGGTGCCCAGGAGCGGCGATGCACTCGCACTGCAAA 
TGCCGCTCCCACGTATGCGCCCTGGTATGTGCCTGCGTTCTGATAGATGGGGGACTGTGGCTTCT 
CCGTCACTCCATTCTCAGCCCCTAGCAGAGCGTCTGGCACACTAGATTAGTAGTAAATGCTTGAT 
GAGAAGAACACATCAGGCACTGCGCCACCTGCTTCACAGTACTTCCCAACAACTCTTAGAGGTAG 
GT GTAT TCCC GT T T TACAGATAAGGAAAC TGAGGCCCAGAGAGC T GAAG TAC TGCACCCAGCAT C 
ACCAGCTAGAAAGTGGCAGAGCCAGGATTCAACCCTGGCTTGTCTAACCCCAGGTTTTCTGCTCT 
GTCCAATTCCAGAGCTGTCTGGTGATCACTTTATGTCTCACAGGGACCCACATCCAAACATGTAT 
CT CTAATGAAAT T GT GAAAGC TCCAT G T TTAGAAATAAATGAAAACACC TGA 
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FIGURE 146 



MRKHLSWWWLATVCMLLFSHLSAVQTRGIKHRIKWNRKALPSTAQITEAQVAENRPGAFIKQGRK 
LDI DFGAEGNRYYEANYWQFPDG IHYNGCS EANVTKEAFVTGC INATQAANQGE FQKPDNKLHQQ 
VLWRLVQELCSLKHCEFWLERGAGLRVTMHQPVLLCLIALIWIMVK 

Important features of the protein: 
Signal peptide: 

amino acids 1-2 6 

Transmembrane domain: 

amino acids 157-171 



Zl N-glycosylation sites. 

y s 

l : amino acids 98-102, 110-114 

Tyrosine kinase phosphorylation site. 

.in 

amino acids 7 6-83 

Q 
'III 

#*i N-myristoylation sites. 

fll amino acids 71-77, 88-94, 93-99, 107-113, 154-160 



Amidation site. 

amino acids 62-66 



APP 



ID= 



10063561 
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FIGURE 147 

GCCTTGGCCTCCCAAAGGGCTGGGATTATAGGCGTGACCACCATGTCTGGTCCAGAGTCTCATTT 
CCTGATGATTTATAGACTCAAAGAAAACTC ATG TTCAGAAGCTCTCTTCTCTTCTGGCCTCCTCT 
CTGTCTTCTTTCCCTCTTTCTTCTTATTTTAATTAGTAGCATCTACTCAGAGTCATGCAAGCTGG 
AAATCTTTCATTTTGCTTGTCAGTGGGGTAGGTCACTGAGTCTTAGTTTTTATTTTTTGAAATTT 
CAACTTTCAGATTCAGGGGGTACATGTGAAGGTTTGTTTTATGAGTATATTGCATCATGCTGAGG 
TTTGGGGT 

u 

O 
CI! 
m 
lil 
in 
en 

13 
111 
O 

ry 

rll 
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FIGURE 148 



O 
□ 
Cf! 
m 

m 

If! 

o 

ill 
CI! 
Ill 
Q 

111 



MFRSSLLFWPPLCLLSLFLLILISSIYSESCKLEIFHFACQWGRSLSLSFYFLKFQLSDSGGTCE 
GLFYEYIA 

Important features of the protein: 
Signal peptide; 

amino acids 1-25 

N-myristoylation site. 

amino acids 62-68 
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FIGURE 14Q 



GTCTCCGCGTCACAGGAACTTCAGCACCCACAGGGCGGACAGCGCTCCCCTCTACCTGGAGACTTGAC 
TCCCGCGCGCCCCAACCCTGCTTATCCCTTGACCGTCGAGTGTCAGAGATCCTGCAGCCGCCCAGTCC 
CGGCCCCTCTCCCGCCCCACACCCACCCTCCTGGCTCTTCCTGTTTTTACTCCTCCTTTTCATTCATA 
ACAAAAGCTACAGCTCCAGGAGCCCAGCGCCGGGCTGTGACCCAAGCCGAGCGTGGAAGAATGGGGTT 
CCTCGGGACCGGCACTTGGATTCTGGTGTTAGTGCTCCCGATTCAAGCTTTCCCCAAACCTGGAGGAA 
GCCAAGACAAAT CTCTACATAATAGAGAATTAAGT GCAGAAAGACCT TTGAATGAACAGAT TGCTGAA 
GCAGAAGAAGACAAGATTAAAAAAACATATCCTCCAGAAAACAAGCCAGGTCAGAGCAACTATTCTTT 
T GT T GAT AACT T GAAC CT G CT AAAGGC AATAAC AGAAAAGGAAAAAAT T GAG AAAGAAAGACAAT CT A 
TAAGAAGCT CCCCACT TGATAATAAGT T GAAT GTGGAAGAT GTTGAT TCAACCAAGAATCGAAAACTG 
ATCGATGAT TAT GACT CTACTAAGAGTGGAT T GGATCATAAATTT CAAGATGATCCAGATGGTCTTCA 
TCAACTAGACGGGACTCCTTTAACCGCTGAAGACATTGTCCATAAAATCGCTGCCAGGATTTATGAAG 
AAAATGACAGAGCCGT GTT TGACAAGAT TGT T TCTAAACT ACTTAAT CTCGGCCT TATCACAGAAAGC 
CAAGCACAT ACACTGGAAGAT GAAGTAGCAGAGGT T T TACAAAAATT AATCTCAAAGGAAGCCAACAA 
TTAT GAGGAGGAT CCCAATAAGCCCACAAGC T GGACT GAGAATCAGG CTGGAAAAATACCAGAGAAAG 
T GACTCCAATGGCAGCAAT TCAAGATGGTCT T GCTAAGGGAGAAAACGATGAAACAGTAT CTAACACA 
T TAAC CT T GACAAAT G GCT TG GAAAGGAGAAC T AAAAC C T ACAGT GAAGACAAC T T T GAGGAAC T CCA 
ATAT T TCCCAAAT TTCTAT GCGCTACT GAAAAGTATT GAT T CAGAAAAAGAAGCAAAAGAGAAAGAAA 
CAC T GAT TACTAT CAT GAAAACAC T GAT T GAC T T T GT GAAGAT GATG G T GAAATAT GGAACAATATC T 
CCAGAAGAAGGT GTTTCCTACCTTGAAAACT T GGATGAAATGAT TGC TCTTCAGACCAAAAACAAGCT 
AGAAAAAAAT G C T AC T GAC AATAT AAGCAAGC T T T TCC CAGCAC CAT CAGAG AAGAG T CAT GAAGAAA 
CAGACAGTACCAAGGAAGAAGCAGCTAAGAT GGAAAAGGAATAT GGAAGCTT GAAGGATT CCACAAAA 
GAT G AT AAC T C C AAC C C AG GAGGAAAG ACAG AT GAAC C C AAAG G AAAAAC AG AAGC CT AT T T GGAAG C 
CATCAGAAAAAAT AT TGAATGGTT GAAGAAACATGACAAAAAGGGAAATAAAGAAGATTAT GACCTTT 
CAAAGAT GAGAGACT T CAT CAAT AAACAAGC T GAT GC T TAT GT G GAGAAAGG CATC C T TGACAAG GAA 
GAAGC CGAGGCCATCAAGCGCAT TTATAGCAGCCT GTAAAAATGGCAAAAGAT CCAGGAGT CT T TCAA 
CTGTTTCAGAAAACATAATATAGCT TAAAACACTT CT AATT CT GT GATTAAAATTTT TTGACCCAAGG 
GTTATTAGAAAGTGCTGAATTTACAGTAGTTAACCTTTTACAAGTGGTTAAAACATAGCTTTCTTCCC 
GTAAAAAC TAT C T GAAAGT AAAG T T GTATGTAAGCTGAAAAAAAAAAAAAAAAAAAA 
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FIGURE 1*0 

MGFLGTGTWILVLVLPIQAFPKPGGSQDKSLHNRELSAERPLNEQIAEAEEDKIKKTYPPENKPG 
QSNYSFVDNLNLLKAITEKEKIEKERQSIRSSPLDNKLNVEDVDSTKNRKLIDDYDSTKSGLDHK 
FQDDPDGLHQLDGT PLT AED I VHKI AARI YEENDRAVFDKI VSKLLNLGL I TE SQAHTLEDE VAE 
VLQKLISKEAJSnSTYEEDPNKPTSWTENQAGKIPEKVTPMAAIQDGLAKGENDETVSNTLTLTNGLE 
RRTKTYSEDNFEELQYFPNFYALLKSIDSEKEAKEKETLITIMKTLIDFVKMMVKYGTISPEEGV 
SYLENLDEMIALQTKNKLEKNATDNISKLFPAPSEKSHEETDSTKEEAAKMEKEYGSLKDSTKDD 
NSNPGGKTDE PKGKTEAYLEAIRKNIEWLKKHDKKGNKEDYDLSKMRDFINKQADAYVEKGI LDK 
EEAEAIKRIYSSL 

N-glycosylation sites: 

amino acids 68-71, 346-349, 350-353 

Casein kinase II phosphorylation site: 

amino acids 70-73, 82-85, 97-100, 125-128, 147-150, 188-191, 217- 

220, 265-268, 289-292, 305-308, 320-323, 326-329, 362-365, 368- 
341, 369-372, 382-385, 386-389, 387-390 

N-myristoylation sites: 

amino acids 143-148, 239-244 
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FIGURE 151 



C GGCT CGAGGCT C CCGCCAGGAGAAAGGAACATT C T GAGGG GAGTC TACACCC TGTGGAGC TCAA 
GATGGTCCTGAGTGGGGCGCTGTGCTTCCGAATGAAGGACTCGGCATTGAAGGTGCTTTATCTGC 
ATAATAACCAGCTTCTAGCTGGAGGGCTGCATGCAGGGAAGGTCATTAAAGGTGAAGAGATCAGC 
GTGGTCCCCAATCGGTGGCTGGATGCCAGCCTGTCCCCCGTCATCCTGGGTGTCCAGGGTGGAAG 
CCAGTGCCTGTCATGTGGGGTGGGGCAGGAGCCGACTCTAACACTAGAGCCAGTGAACATCATGG 
AGCTCTATCTTGGTGCCAAGGAATCCAAGAGCTTCACCTTCTACCGGCGGGACATGGGGCTCACC 
TCCAGCTTCGAGTCGGCTGCCTACCCGGGCTGGTTCCTGTGCACGGTGCCTGAAGCCGATCAGCC 
TGTCAGACTCACCCAGCTTCCCGAGAATGGTGGCTGGAATGCCCCCATCACAGACTTCTACTTCC 
AGCAGTGTGACTAGGGCAACGTGCCCCCCAGAACTCCCTGGGCAGAGCCAGCTCGGGTGAGGGGT 
GAGTGGAGGAGACCCATGGCGGACAATCACTCTCTCTGCTCTCAGGACCCCCACGTCTGACTTAG 
^ TGGGCACCTGACCACTTTGTCTTCTGGTTCCCAGTTTGGATAAATTCTGAGATTTGGAGCTCAGT 
13 CCACGGTCCTCCCCCACTGGATGGTGCTACTGCTGTGGAACCTTGTAAAAACCATGTGGGGTAAA 
O CTGGGAATAACATGAAAAGATTTCTGTGGGGGTGGGGTGGGGGAGTGGTGGGAATCATTCCTGCT 
TAATGGTAACTGACAAGTGTTACCCTGAGCCCCGCAGGCCAACCCATCCCCAGTTGAGCCTTATA 
VI GGGTCAGTAGCTCTCCACATGAAGTCCTGTCACTCACCACTGTGCAGGAGAGGGAGGTGGTCATA 
W.I GAGTCAGGGATCTATGGCCCTTGGCCCAGCCCCACCCCCTTCCCTTTAATCCTGCCACTGTCATA 
Iff TGCTACCTTTCCTATCTCTTCCCTCATCATCTTGTTGTGGGCATGAGGAGGTGGTGATGTCAGAA 
ff| GAAATGGCT CGAGC TCAGAAGATAAAAGATAAGT AGGGTAT GC TGATCC TC T T TT AAAAACCCAA 

1^ GATACAATCAAAATCCCAGATGCTGGTCTCTATTCCCATGAAAAAGTGCTCATGACATATTGAGA 
AGACC TACT TACAAAGTGGCATATATTGCAAT TTAT TT TAATTAAAAGATACCTATTTATATAT T 
^ TCT TTAT AGAAAAAAGTCTGGAAGAGTT TACT TCAATT GTAGCAATGTCAGGGTGGT GGCAGTAT 

ClJ AGGTGAT TT TTCTT TT AAT TCTGT TAAT T TAT CT GT AT T T CCTAATT TT TCTACAAT GAAGATGA 

If! ATTCCTTGTATAAAAATAAGAAAAGAAAT TAATCT T GAGGTAAGCAGAGCAGACATCATCTCTGA 

f4 TTGTCCTCAGCCTCCACTTCCCCAGAGTAAATTCAAATTGAATCGAGCTCTGCTGCTCTGGTTGG 
|*f T TGTAGT AGT GAT CAGGAAACAGATC TCAGCAAAGC CACT GAGGAGGAGGC TGTGC T GAGT TT GT 

GTGGCTGGAATCTCTGGGTAAGGAACTTAAAGAACAAAAATCATCTGGTAATTCTTTCCTAGAAG 
13 GAT CACAGCCCCT GGGATT CCAAGGCAT TGGATCCAGT CT CT AAGAAGGC TGC TGTACTGG T TGA 

flJ ATTGTGTCCCCCTCAAATTCACATCCTTCTTGGAATCTCAGTCTGTGAGTTTATTTGGAGATAAG 
GTC TC TGCAGAT GT AGT T AGT TAAGACAAGGT CAT GCT GGAT GAAGGTAGACC TAAATT CAATAT 
GACTGGTTTCCTTGTATGAAAAGGAGAGGACACAGAGACAGAGGAGACGCGGGGAAGACTATGTA 
AAGATGAAGGCAGAGATCGGAGTTTTGCAGCCACAAGCTAAGAAACACCAAGGATTGTGGCAACC 
ATCAGAAGCTTGGAAGAGGCAAAGAAGAATTCTTCCCTAGAGGCTTTAGAGGGATAACGGCTCTG 
C TGAAACCT TAAT C TCAGACT TCCAGCC TCCT GAAC GAAGAAAGAATAAAT TT CGGC TGT T T TAA 
GCCACCAAGGATAATTGGTTACAGCAGCTCTAGGAAACTAATACAGCTGCTAAAATGATCCCTGT 
CTCCTCGTGTTTACAT TCTGT GTGTGTCCCCTCCCACAATGTACCAAAGTTGTCTTTGTGACCAA 
TAGAATATGGCAGAAGTGAT GGCAT GCCAC T T CCAAGAT T AGG T TATAAAAGACACT GCAGCT TC 
TACTTGAGCCCTCTCTCTCTGCCACCCACCGCCCCCAATCTATCTTGGCTCACTCGCTCTGGGGG 
AAGCTAGCTGCCATGCTATGAGCAGGCCTATAAAGAGACTTACGTGGTAAAAAATGAAGTCTCCT 
GCCCACAGCCACATTAGTGAACC TAGAAGCAGAGACTCTGTGAGATAATCGATGTTT GT TGTTT T 
AAGTTGC TCAGTTT TGGTCTAAC TTGTTATGCAGCAATAGATAAATAATATGCAGAGAAAGAG 
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FIGURE 152 

MVLSGALCFRMKDSALKVLY 

QCLSCGVGQEPTLTLEPYNIMELYLGAKESKSFTFYRRDMGLTSSFESAAYPGWFLCTVPEADQP 
VRLTQLPENGGWNAPITDFYFQQCD 

N-myristoylation sites. 

amino acids 29-34, 30-35, 60-65, 63-68, 73-78, 91-96, 106-111 

Interleukin-1 signature. 

amino acids 111-131 

Interleukin-1 proteins. 

amino acids 8-29, 83-120, 95-134, 64-103 
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FIGURE 15 3 

CTTCAGAACAGGTTCTCCTTCCCCAGTCACCAGTTGCTCGAGTTAGAATTGTCTGCA ATG GCCGC 
CCTGCAGAAATCTGTGAGCTCTTTCCTTATGGGGACCCTGGCCACCAGCTGCCTCCTTCTCTTGG 
CCCTCTTGGTACAGGGAGGAGCAGCTGCGCCCATCAGCTCCCACTGCAGGCTTGACAAGTCCAAC 
T TCCAGCAGCCCTATAT CAC CAAC CGCACCTT CAT GCT GGC TAAGGAGGC TAGCT TGGC T GATAA 
CAACACAGACGT T CGT CTCAT TGGGGAGAAAC T GT T CCACGGAGT CAGTATGAGT GAGCGC T GCT 
ATCTGATGAAGCAGGTGCTGAACTTCACCCTTGAAGAAGTGCTGTTCCCTCAATCTGATAGGTTC 
CAGCCTTATATGCAGGAGGTGGTGCCCTTCCTGGCCAGGCTCAGCAACAGGCTAAGCACATGTCA 
TAT TGAAGGT GATGACC TGCATATCCAGAGGAATGTGCAAAAGC TGAAGGACACAGTGAAAAAGC 
T TGGAGAGAGTGGAGAGATCAAAGCAATTGGAGAACTGGATT TGCTGT TTATGTCTCTGAGAAAT 
GCCTGCATT TGA CCAGAGCAAAGC T GAA&AATGAATAACTAACCCCCT T TCCC TGCTAGAAATAA 
CAATTAGATGCCCCAAAGCGATTTTTTTTAACCAAAAGGAAGATGGGAAGCCAAACTCCATCATG 
ATGGGTGGAT TC CAAAT GAACCCC TGCGT TAGT TACAAAGGAAAC CAAT GCCACT T T TG T T TATA 
AGACCAGAAGGTAGACT T T CT AAGCATAGATAT T TAT TGATAACAT T TCAT TGTAAC TGGTGTTC 
TATACACAGAAAACAATTTAT TT TT TAAATAATTGTCTT TT TCCATAAAAAA.GATTACTT TCCAT 
TCC TT TAGGGGAAAAAACCCC TAAATAGC T T CAT G TT T CCATAAT CAGTACT T TAT AT T TAT AAA 
TGTAT T TAT TAT TATTAT AAGAC TGCAT T T TATT TATAT CAT T TTAT TAATATGGAT T TAT T TAT 
AGAAACATCAT T CGAT AT TGCTAC T TGAGTG TAAGGCT AA.TAT T GAT AT T TATGACAATAATTAT 
AGAGCTATAACAT GTT TATT TGACCTCAATAAACACTTGGATATCCC 
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FIGURE 154 

MAALQKS VS S FMGTLAT SCLLLLALL VQGGAAAP I S SHCRLDKSNFQQPY I TNRT FMLAKE ASL 
ADNNTDWLIGEKLFHGVSMSERCYmKQVl.NFTLEEVLFPQSDRFQPYMQEWPFLARLSNRLS 
TCH I EGDDLH I QRNVQKLKDT VKKLGE S GE I KAI GELDLL FMS LRNAC I 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 

N-glycosylation sites. 

amino acids 54-58, 68-72, 97-101 

N-myristoylation sites. 

amino acids 14-20, 82-88 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 10-21 
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FIGURE 15 5 

GGCTTGCTGAAAATAAAATCAGGACTCCTAACCTGCTCCAGTCAGCCTGCTTCCACGAGGCCTGT 
CAGTCAGTGCCCGACTTGTGACTGAGTGTGCAGTGCCCAGCATGTACCAGGTCAGTGCAGAGGGC 
TGCCTGAGGGCTGTGCTGAGAGGGAGAGGAGCAGAGATGCTGCTGAGGGTGGAGGGAGGCCAAGC 
TGCCAGGTTTGGGGCTGGGGGCCAAGTGGAGTGAGAAACTGGGATCCCAGGGGGAGGGTGCAGAT 
GAGGGAGCGACCCAGAT T AGGTGAGGACAGT TCTC TCAT T AGCCT T T TCCTACAGGT GGT TGCAT 
TCTTGGCAATGGTCATGGGAACCCACACCTACAGCCACTGGCCCAGCTGCTGCCCCAGCAAAGGG 
CAGGACACCTCTGAGGAGCTGCTGAGGTGGAGCACTGTGCCTGTGCCTCCCCTAGAGCCTGCTAG 
GCCCAACCGCCACCCAGAGTCCTGTAGGGCCAGTGAAGATGGACCCCTCAACAGCAGGGCCATCT 
CCCCCTGGAGATATGAGTTGGACAGAGACTTGAACCGGCTCCCCCAGGACCTGTACCACGCCCGT 
TGCCTGTGCCCGCACTGCGTCAGCCTACAGACAGGCTCCCAGATGGACCCCCGGGGCAACTCGGA 
GC TGCT C TACCACAACCAGAC TGT C T TC TACAGGCGGCCAT GCCATGGCGAGAAGGGCACCCACA 
AGGGCTACTGCCTGGAGCGCAGGCTGTACCGTGTTTCCTTAGCTTGTGTGTGTGTGCGGCCCCGT 
GTGATGGGC TAGC CGGACCTGCTGGAGGCTGGTCCCTTTTTGGGAAACCTGGAGCCAGGTGTACA 
ACCACTTGCCATGAAGGGCCAGGATGCCCAGATGCTTGGCCCCTGTGAAGTGCTGTCTGGAGCAG 
CAGGAT C CC GG GAC AGGAT GGGGGGCTTT GGGGAAAACC T GCAC T T C T G C AC AT T T T GAAAAGAG 
CAGCTGCTGCTTAGGGCCGCCGGAAGCTGGTGTCCTGTCATTTTCTCTCAGGAAAGGTTTTCAAA 
GTTCTGCCCATTTCTGGAGGCCACCACTCCTGTCTCTTCCTCTTTTCCCATCCCCTGCTACCCTG 
GCCCAGCACAGGCAC T T TCTAGATAT T T CCCC CT TGCT GGAGAAGAAAGAGC CCCT GGTT T TATT 
TGTTTGTTTACTCATCACTCAGTGAGCATCTACTTTGGGTGCATTCTAGTGTAGTTACTAGTCTT 
T TGACAT GGAT GAT T CT GAGGAGGAAGC TGT TAT T GAAT GT ATAGAGAT T TATCCAAATAAATAT 
CTTTATTTAAAAATGAAAAA 
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FIGURE 156 



MRERPRLGEDSSLISLFLQWAFLA^^VMGTHTYSHWPSCCPSKGQDTSEELLRWSTVPVPPLEPA 
RPNRHPESCRASEDGPLNSRAISPWRYELDRDLNRLPQDLYHARCLCPHCVSLQTGSHMDPRGNS 
ELLYHNQTVFYRRPCHGEKGTHKGYCLERRLYRVSLACVCVRPRVMG 

Important features of the protein: 
Signal peptide: 

amino acids 1-32 

N-glycosylation site. 

amino acids 136-140 

Tyrosine kinase phosphorylation site. 

amino acids 127-135 

N-myristoylation sites, 
amino acids 44-50, 150-156 
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FIGURE 157 

CCGGCGATGTCGCTCGTGCTGCTAAGCCTGGCCGCGCTGTGCAGGAGCGCCGTACCCCGAGAGCC 
GACCGTTCAATGTGGCTCTGAAACTGGGCCATCTCCAGAGTGGATGCTACAACATGATCTAATCC 
CCGGAGACT TGAGGGACC T CCGAGTAGAACC T GT TACAACTAGTGT TGCAACAGGGGACT AT T CA 
ATTTTGATGAATGTAAGCTGGGTACTCCGGGCAGATGCCAGCATCCGCTTGTTGAAGGCCACCAA 
GAT TTGTGTGACGGGCAAAAGCAACT TCCAGT CCTACAGCTGTGTGAGGTGCAATTACACAGAGG 
CCTTCCAGACTCAGACCAGACCCTCTGGTGGTAAATGGACATTTTCCTACATCGGCTTCCCTGTA 
GAGC TGAACACAGT CTAT TT CAT TGGGGCCCATAATAT T CCTAAT GCAAATATGAAT GAAGAT GG 
IZ CCCTTCCATGTCTGTGAATTTCACCTCACCAGGCTGCCTAGACCACATAATGAAATATAAAAAAA 
7k AGT GTGT CAAGGCCGGAAGCC TGTGGGATCCGAACAT CACT GCTT G TAAGAAGAAT GAGGAGACA 

|fl GTAGAAGTGAACT T CACAACCAC T CCCC TGGGAAACAGATACATGGCT CT TATCCAACACAGCAC 

W TATCATCGGGTTTTCTCAGGTGTTTGAGCCACACCAGAAGAAACAAACGCGAGCTTCAGTGGTGA 
JJI TTCCAGTGACTGGGGATAGTGAAGGTGCTACGGTGCAGCTGACTCCATATTTTCCTACTTGTGGC 
AGCGACTGCATCCGACATAAAGGAACAGTTGTGCTCTGCCCACAAACAGGCGTCCCTTTCCCTCT 
5 GGATAACAACAAAAGCAAGCCGGGAGGCTGGCTGCCTCTCCTCCTGCTGTCTCTGCTGGTGGCCA 
Q CATGGGTGCTGGTGGCAGGGATCTATCTAATGTGGAGGCACGAAAGGATCAAGAAGACTTCCTTT 
ITi TCTACCACCACACTACTGCCCCCCATTAAGGTTCTTGTGGTTTACCCATCTGAAATATGTTTCCA 
!f;; T CACACAAT T TGT TACT T CACTGAATT T C T T CAAAACCAT T GCAGAAGTGAGGTCAT CCT TGAAA 

IS AGT GGCAGAAAAAGAAAATAGCAGAGATGGGT CCAGT GCAGT GGC T TGCCAC T CAAAAGAAGGCA 

l?J 

fh GCAGACAAAGTCGTCTTCCTTCTTTCCAATGACGTCAACAGTGTGTGCGATGGTACCTGTGGCAA 
GAGCGAGGGCAGTCCCAGTGAGAACTCTCAAGACCTCTTCCCCCTTGCCTTTAACCTTTTCTGCA 
GTGATC TAAGAAGC CAGAT T CATC TGCACAAATACGT GGTGG TC TACT T TAGAGAGAT T GATACA 
AAAGACGATTACAATGCTCTCAGTGTCTGCCCCAAGTACCACCTCATGAAGGATGCCACTGCTTT 
CTGTGCAGAACTTCTCCATGTCAAGCAGCAGGTGTCAGCAGGAAAAAGATCACAAGCCTGCCACG 
ATGGCTGCTGCTCCTTGTAG 
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FIGURE 158 



MSLVLLSLAALCRSAVPREPTVQCGSETGPSPEWMLQHDLIPGDLRDLRVEPVTTSVATGDYSILMNVSWV 
LRADASIRLLKATKICVTGKSNFQSYSCVRCNYTEAFQTQTRPSGGKWTFSYIGFPVELNTVYFIGAHNIP 
NANMNEDGPSMSVNFTSPGCLDHIMKYKKKCV 

ST 1 1 GFSQVFEPHQKKQTRASWI PVTGDS EGAT VQLT PY FPTCGSDC IRHKGT WLCPQTGVP FPLDNNK 
SKPGGWLPLLLLSLLVATWVLVAGIYLMWRHERIBCKTSFSTTTLLPPIKVLWYPSEICFHHTICYFTEFL 
QNHCRSEVILEKWQKKKIAEMGPVQWIATQKKAADKVVFLLSNDVNSVCDGTCGKSEGSPSENSQDLFPLA 
FNLFCSDLRSQIHLHKYVWYFREIDTKDDYNALSVCPKYHLMKDATAFCAELLHVKQQVSAGKRSQACHD 
GCCSL 

Important features of the protein: 
Signal peptide: 
amino acids 1-14 

Transmembrane domain: 

amino acids 290-309 

# 

N-glycosylation sites. 

I amino acids 67 - 71, 103 - 107, 156 -160, 183 - 187, 197 - 201 and 283 

I - 287 

J 

cAMP- and cGMP -dependent protein kinase phosphorylation sites. 

amino acids 228 - 232 and 319 - 323 

Casein kinase II phosphorylation sites. 

amino acids 178 - 182, 402 - 406, 414 - 418 and 453 - 457 

N-myristoylation site. 

amino acids 116-122 

Amidation site. 

amino acids 488-452 
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FIGURE 15Q 



m 

Q 



Q 

ru 



AGCCACCAGCGCAACATGACAGTGAAGACCCTGCATGGCCCAGCCATGGTCAAGTACTTGCTGCT 
GTCGATATTGGGGCTTGCCTTTCTGAGTGAGGCGGCAGCTCGGAAAATCCGCAAAGTAGGACATA 
CTTTTTTCCAAAAGCCTGAGAGTTGCCGGCCTGTGCCAGGAGGTAGTATGAAGCTTGACATTGGC 
ATCATCAATGAAAACCAGCGCGTTTCCATGTCACGTAACATCGAGAGCCGCTCCACCTCCCCCTG 
GAATTACACTGTCACTTGGGACCCCAACCGGTACCCCTCGGAAGTTGTACAGGCCCAGTGTAGGA 
AC T TGGGC TGCAT CAAT GCT CAAGGAAAGGAAGACAT CT CCATGAATT CCGT TCCCATCCAGCAA 
GAGACCCTGGTCGTCCGGAGGAAGCACCAAGGCTGCTCTGTTTCTTTCCAGTTGGAGAAGGTGCT 
GGTGACTGTTGGCTGCACCTGCGTCACCCCTGTCATCCACCATGTGCAGTAAGAGGTGCATATCC 
ACTCAGCTGAAGAAG 
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FIGURE 160 

MTVKTLHGPAMVKYLLLSILGLAFLSEAAARKIPKVGHTFFQKPESCPPVPGGSMKLDIGIINEN 
QRVSMSRNIESRSTSPWNYTVTWDPNRYPSEWQAQCRNLGCINAQGKEDISMNSVPIQQETLW 
RRKHQGCSVSFQLEKVLVTVGCTCVTPVIHHVQ 



Signal sequence: 

amino acids 1-30 

f~v N-glycosylation site. 

fj amino acids 83-87 

m 



hi 
m 
m 

a 
in 
13 
rii 
a 
ru 



N-myristoylation sites. 

amino acids 106-111, 136-141 
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FIGURE 161 



ACAC T GGCCAAACAAAAACGAAAGCAC TCCGT GCT GGAAGTAGGAGGAGAGT CAGGACT CCCAGG 
ACAGAGAGTGCACAAACTACCCAGCACAGCCCCCTCCGCCCCCTCTGGAGGGTGAAGAGGGATTC 
CAGCCCCTGCCACCCACAGACACGGGCTGACTGGGGTGTCTGCCCCCCTTGGGGGGGGGCAGCAC 
AGGGCCTCAGGCCTGGGTGCCACCTGGCACCTAGAAGATGCCTGTGCCCTGGTTCTTGCTGTCCT 
TGGCACTGGGCCGAAGCCCAGTGGTCCTTTCTCTGGAGAGGCTTGTGGGGCCTCAGGACGCTACC 
CACTGCTCTCCGGGCCTCTCCTGCCGCCTCTGGGACAGTGACATACTCTGCCTGCCTGGGGACAT 
CGTGCCTGCTCCGGGCCCCGTGCTGGCGCCTACGCACCTGCAGACAGAGCTGGTGCTGAGGTGCC 
AGAAGGAGACCGACTGTGACCTCTGTCTGCGTGTGGCTGTCCACTTGGCCGTGCATGGGCACTGG 
GAAGAGCCTGAAGATGAGGAAAAGTTTGGAGGAGCAGCTGACTCAGGGGTGGAGGAGCCTAGGAA 
TGCCTCTCTCCAGGCCCAAGTCGTGCTCTCCTTCCAGGCCTACCCTACTGCCCGCTGCGTCCTGC 
H TGGAGGTGCAAGTGCCTGCTGCCCTTGTGCAGTTTGGTCAGTCTGTGGGCTCTGTGGTATATGAC 
O TGCTTCGAGGCTGCCCTAGGGAGTGAGGTACGAATCTGGTCCTATACTCAGCCCAGGTACGAGAA 
Q GGAACTCAACCACACACAGCAGCTGCCTGCCCTGCCCTGGCTCAACGTGTCAGCAGATGGTGACA 
it. ACGTGCATCTGGTTCTGAATGTCTCTGAGGAGCAGCACTTCGGCCTCTCCCTGTACTGGAATCAG 
fl GTCCAGGGCCCCCCAAAACCCCGGTGGCACAAAAACCTGACTGGACCGCAGATCATTACCTTGAA 
W CCACACAGACCTGGTTCCCTGCCTCTGTATTCAGGTGTGGCCTCTGGAACCTGACTCCGTTAGGA 
If! CGAACATCTGCCCCTTCAGGGAGGACCCCCGCGCACACCAGAACCTCTGGCAAGCCGCCCGACTG 
Hi CGACTGCTGACCCTGCAGAGCTGGCTGCTGGACGCACCGTGCTCGCTGCCCGCAGAAGCGGCACT 
y r GTGCTGGCGGGCTCCGGGTGGGGACCCCTGCCAGCCACTGGTCCCACCGCTTTCCTGGGAGAACG 
TCACTGTGGACAAGGTTCTCGAGTTCCCATTGCTGAAAGGCCACCCTAACCTCTGTGTTCAGGTG 
35 AACAGCTCGGAGAAGCTGCAGCTGCAGGAGTGCTTGTGGGCTGACTCCCTGGGGCCTCTCAAAGA 
S CGATGTGCTACTGTTGGAGACACGAGGCCCCCAGGACAACAGATCCCTCTGTGCCTTGG7\ACCCA 
|fl GTGGCTGTACTTCACTACCCAGCAAAGCCTCCACGAGGGCAGCTCGCCTTGGAGAGTACTTACTA 
CAAGACCTGCAGTCAGGCCAGTGTCTGCAGCTATGGGACGATGACTTGGGAGCGCTATGGGCCTG 
*** CCCCATGGACAAATACATCCACAAGCGCTGGGCCCTCGTGTGGCTGGCCTGCCTACTCTTTGCCG 
y CTGCGCTTTCCCTCATCCTCCTTCTCAAAAAGGATCACGCGAAAGGGTGGCTGAGGCTCTTGAAA 
Q CAGGACGTCCGCTCGGGGGCGGCCGCCAGGGGCCGCGCGGCTCTGCTCCTCTACTCAGCCGATGA 
pfj CTCGGGTTTCGAGCGCCTGGTGGGCGCCCTGGCGTCGGCCCTGTGCCAGCTGCCGCTGCGCGTGG 
CCGTAGACCTGTGGAGCCGTCGTGAACTGAGCGCGCAGGGGCCCGTGGCTTGGTTTCACGCGCAG 
CGGCGCCAGACCCTGCAGGAGGGCGGCGTGGTGGTCTTGCTCTTCTCTCCCGGTGCGGTGGCGCT 
GTGCAGCGAGTGGCTACAGGATGGGGTGTCCGGGCCCGGGGCGCACGGCCCGCACGACGCCTTCC 
GCGCCTCGCTCAGCTGCGTGCTGCCCGACTTCTTGCAGGGCCGGGCGCCCGGCAGCTACGTGGGG 
GCCTGCTTCGACAGGCTGCTCCACCCGGACGCCGTACCCGCCCTTTTCCGCACCGTGCCCGTCTT 
CACACTGCCCTCCCAACTGCCAGACTTCCTGGGGGCCCTGCAGCAGCCTCGCGCCCCGCGTTCCG 
GGCGGCTCCAAGAGAGAGCGGAGCAAGTGTCCCGGGCCCTTCAGCCAGCCCTGGATAGCTACTTC 
CATCCCCGGGGGACTCCCGCGCCGGGACGCGGGGTGGGACCAGGGGCGGGACCTGGGGCGGGGGA 
CGGGACTTAAATAAAGGCAGACGCTGTTTTTCTAAAAAAA 
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FIGURE 162 

MPVPWFLLSLALGRSPWLSLERLVGPQDATHGSPGLSCRLWDSDILCLPGDIVPAPGPVLAPTHLQTELV 
LRCQKETDCDLCLRVAVHLAVHGHWEEPEDEEKFGGAADSGVEEPRNASLQAQWLSFQAYPTARCVLLEV 
QVPAALVQFGQSVGSVVYDCFEAALGSEVRIWSYTQPRYEKELNHTQQLPALPWLNVSADGDNVHLVLNVS 
EEQH FGLSLY WNQVQGPPKPRWHKNLTG PQ 1 1 TLNHTDL VPCLC IQ VWPLE PDS VRTN ICPFRE DPRAHQN 
LWQAARLRLLTLQSWLLDAPCSLPAEAALCWRAPGGDPCQPLVPPLSWElSr^VDKVLEFPLLKGHPNLCVQ 
VNSSEKLQLQECLWADSLGPLKDDVLLLETRGPQDNRSLCALEPSGCTSLPSKASTRAARLGEYLLQDLQS 
GQCLQLWDDDLGALWACPMDKY I HKRWAL WLACLLFAAALSLILLLKPCDHAKGWLRLLKQDVRSGAAARG 
H RAALLL Y S AD DS G FE RLVGALASALCQL PLRVAVDLW S RRELSAQG P VAW F H AQ RRQT LQEGGVWLL FS P 

O GAVALCSEWLQDGVSGPGAHGPHDAFRASLSCVLPDFLQGRAPGSYVGACFDRLLHPDAVPALFRTVPVFT 
LPSQLPDFLGALQQPRAPRSGRLQERAEQVSRALQPALDSYFHPPGTPAPGRGVGPGAGPGAGDGT 

01 

Signal sequence: 

fB% amino acids 1-20 

V. '. 

s Transmembrane domain. 

Q amino acids 453-475 



N-glycosylation sites. 

amino acids 118-121, 186-189, 198-201, 211-214, 238-241, 248-251, 
334-337, 357-360, 391-394 

Glycosaminoglycan attachment site. 

amino acids 583-586 

cAMP- and cGMP- dependent protein kinase phosphorylation site. 

amino acids 552-555 



N-myristoylation sites. 

amino acids 107-112, 152-157, 319-324, 438-443, 516-521, 612-617, 
692-697, 696-701, 700-705 
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FIGURE 163 



GGGAGGGCTCTGTGCCAGCCCCGATGAGGACGCTGCTGACCATCTTGACTGTGGGATCCCTGGCT 
GCTCACGCCCCTGAGGACCCCTCGGATCTGCTCCAGCACGTGAAATTCCAGTCCAGCAACTTTGA 
AAACATCCTGACGT GGGACAGC GGGCCAGAGGGCACCCCAGACACGGT C TACAGCAT CGAGT ATA 
AGACGTACGGAGAGAGGGACTGGGTGGCAAAGAAGGGCTGTCAGCGGATCACCCGGAAGTCCTGC 
AACCTGACGGTGGAGACGGGCAACCTCACGGAGCTCTACTATGCCAGGGTCACCGCT 
GTCAGTGCGGGAGGCCGGTCAGCCACCAAGATGACTGACAGGTTCAGCTCTCTGCAGCACACTAC 
CCTCAAGCCACCTGATGTGACCTGTATCTCCAAAGTGAGATCGATTCAGATGATTGTTCATCCTA 
CCCCCACGCCAATCCGTGCAGGCGATGGCCACCGGCTAACCCTGGAAGACATCTTCCATGACCTG 
T TC TACCACT TAGAGCT CCAGGTCAACC GCACCTAC CAAAT GCACC T TGGAGGGAAGCAGAGAGA 
ATATGAGTTCTTCGGCCTGACCCCTGACACAGAGTTCCTTGGCACCATCATGATTTGCGTTCCCA 
^•f CCTGGGCCAAGGAGAGTGCCCCCTACATGTGCCGAGTGAAGACACTGCCAGACCGGACATGGACC 
Q TACTCCTTCTCCGGAGCCTTCCTGTTCTCCATGGGCTTCCTCGTCGCAGTACTCTGCTACCTGAG 
H CTACAGATATGTCACCAAGCCGCCTGCACCTCCCAACTCCCTGAACGTCCAGCGAGTCCTGACTT 
ffii TCCAGCCGCTGCGCTTCATCCAGGAGCACGTCCTGATCCCTGTCTTTGACCTCAGCGGCCCCAGC 
i[l AGTCTGGCCCAGCCTGTCCAGTACTCCCAGATCAGGGTGTCTGGACCCAGGGAGCCCGCAGGAGC 
TCCACAGCGGCATAGCCTGTCCGAGATCACCTACTTAGGGCAGCCAGACATCTCCATCCTCCAGC 
U] CCTCCAACGTGCCACCTCCCCAGATCCTCTCCCCACTGTCCTATGCCCCAAACGCTGCCCCTGAG 
0! GTCGGGCCCCCATCCTATGCACCTCAGGTGACCCCCGAAGCTCAATTCCCATTCTACGCCCCACA 
irt GGCCATCTCTAAGGTCCAGCCTTCCTCCTATGCCCCTCAAGCCACTCCGGACAGCTGGCCTCCCT 
* CCTATGGGGTATGCATGGAAGGTTCTGGCAAAGACTCCCCCACTGGGACACTTTCTAGTCCTAAA 
* M CACCTTAGGCCTAAAGGTCAGCTTCAGAAAGAGCCACCAGCTGGAAGCTGCATGTTAGGTGGCCT 
O T T C TCTGCAGGAGGTGACC T C C T T GGCTATGGAGGAAT CCCAAGAAGCAAAATCAT T GCACCAGC 

111 CCC TGGGGAT T TGCACAGACAGAACATC TGACCCAAAT GTGCTACACAGT GGGGAGGAAGGGACA 

O CCACAGTACCTAAAGGGCCAGCTCCCCCTCCTCTCCTCAGTCCAGATCGAGGGCCACCCCATGTC 
CCTCCCTTTGCAACCTCCTTCCGGTCCATGTTCCCCCTCGGACCAAGGTCCAAGTCCCTGGGGCC 
l!f TGCTGGAGTCCCTTGTGTGTCCCAAGGATGAAGCCAAGAGCCCAGCCCCTGAGACCTCAGACCTG 
W GAGCAGCCCACAGAACTGGATTCTCTTTTCAGAGGCCTGGCCCTGACTGTGCAGTGGGAGTCCTG 
|'|| AGGGGAATGGGAAAGGCTTGGTGCTTCCTCCCTGTCCCTACCCAGTGTCACATCCTTGGCTGTCA 
ATCCCATGCCTGCCCATGCCACACACTCTGCGATCTGGCCTCAGACGGGTGCCGTTGAGAGAAGC 
AGAGGGAGTGGCATGCAGGGCCCCTGCCATGGGTGCGCTCCTCACCGGAACAAAGCAGCATGATA 
AGGACTGCAGCGGGGGAGCTCTGGGGAGCAGCTTGTGTAGACAAGCGCGTGCTCGCTGAGCCCTG 
CAAGGCAGAAAT GACAGT GCAAGGAGGAAATGCAGGGAAAC T CCCGAGGT CCAGAGCCCCACC T C 
CTAACACCATGGATTCAAAGTGCTCAGGGAATTTGCCTCTCCTTGCCCCATTCCTGGCCAGTTTC 
ACAATCTAGCTCGACAGAGCATGAGGCCCCTGCCTCTTCTGTCATTGTTCAAAGGTGGGAAGAGA 
GCC TGGAAAAGAACCAGGCC T GGAAAAGAACCAGAAGGAGGC T GGGCAGAACCAGAACAACCT GC 
ACTTCTGCCAAGGCCAGGGCCAGCAGGACGGCAGGACTCTAGGGAGGGGTGTGGCCTGCAGCTCA 
T T CCCAGCCAGGGCAACT GCCTGACGT TGCAC GAT TT CAGC T TCAT TCC TCTGAT AGAACAAAGC 
GAAATGCAGGTCCACCAGGGAGGGAGACACACAAGCC T T T T C TGCAGGCAGGAGT T T CAGACCCT 
ATCCTGAGAATGGGGTTTGAAAGGAAGGTGAGGGCTGTGGCCCCTGGACGGGTACAATAACACAC 
TGTACTGATGTCACAACTTTGCAAGCTCTGCCTTGGGTTCAGCCCATCTGGGCTCAAATTCCAGC 
CTCACCACTCACAAGCTGTGTGACTTCAAACAAATGAAATCAGTGCCCAGAACCTCGGTTTCCTC 
ATCTGTAATGTGGGGATCATAACACCTACCTCATGGAGTTGTGGTGAAGATGAAATGAAGTCATG 
TCTTTAAAGTGCTTAATAGTGCCTGGTACATGGGCAGTGCCCAATAAACGGTAGCTATTTAAAAA 
AAAAAAAA 
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FIGURE i6zl 

MRTLLTILTVGSLAAHAPEDPSDLLQHVKFQSSNFENILTWDSGPEGTPDTVYSIEYKTYGERDW 
VAKKGCQRITRKSCNLTVETGNLTELYYARVTAVSAGGRSATKMTDRFSSLQHTTLKPPDVTCIS 
KVRS IQMIVHPTPTPIRAGDGHRLTLEDI FHDLFYHLELQVNRTYQMHLGGKQREYEFFGLTPDT 
EFLGTIMICVPTWAKESAPYMCRVKTLPDRTWTYSFSGAFLFSMGFLVAVLCYLSYRYVTKPPAP 
PNSLNVQRVLTFQPLRFIQEHVLIPVFDLSGPSSLAQPVQYSQIRVSGPREPAGAPQRHSLSEIT 
YLGQPDISILQPSNVPPPQILSPLSYAPNAAPEVGPPSYAPQVTPEAQFPFYAPQAISKVQPSSY 
APQATPDSWPPSYGVCMEGSGKDSPTGTLSSPKHLRPKGQLQKEPPAGSCMLGGLSLQEVTSLAM 

5; sr.; 

Q EESQEAKSLHQPLGICTDRTSDPNVLHSGEEGTPQYLKGQLPLLSSVQIEGHPMSLPLQPPSGPC 
O SPSDQGPSPWGLLESLVCPKDEAKSPAPETSDLEQPTELDSLFRGLALTVQWES 

rSi Signal sequence. 

m amino acids 1-17 

S..S :: 

s : 

a Transmembrane domain. 

?!! amino acids 233-250 



f)s N-glycosylation sites. 

|3 amino acids 80-83, 87-90, 172-175 

N-myristoylation sites. 

amino acids 11-16, 47-52, 102-107, 531-536, 565-570 
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FIGURE 16* 

TGGCCTACTGGAAAAAAAAAAAAAAAAAAAAAAAGTCACCCGGGCCCGCGGTGGCCACAACATGG 
CTGCGGCGCCGGGGCTGCTCTTCTGGCTGTTCGTGCTGGGGGCGCTCTGGTGGGTCCCGGGCCAG 
TCGGATCTCAGCCACGGACGGCGTTTCTCGGACCTCAAAGTGTGCGGGGACGAAGAGTGCAGCAT 
GTTAATGTACCGTGGGAAAGCTCTTGAAGACTTCACGGGCCCTGATTGTCGTTTTGTGAATTTTA 
AAAAAGG T GAC GAT G TAT AT G T C TACTACAAAC T G G CAG G GG GAT C C C T T GAAC T T T GGGC TGGA 
AGT GT TGAACACAGT T T TGGATAT T TT CCAAAAGAT T T GATCAAGGTAC T T CATAAATACACGGA 
AGAAGAGCTACATATTCCAGCAGATGAGACAGACT TTGTCTGC TTTGAAGGAGGAAGAGATGATT 
T TAATAGTTATAAT GTAGAAGAGC TT T TAGGAT CT TTGGAAC TGGAGGACTCTGTACCT GAAGAG 
TC GAAGAAAGC TGAAGAAGTT TC TCAGCACAGAGAGAAATCTCCTGAGGAG T C T CGGGGGCGTGA 
ACTTGACCCTGTGCCTGAGCCCGAGGCATTCAGAGCTGATTCAGAGGATGGAGAAGGTGCTTTCT 
CAGAGAGCACCGAGGGGCTGCAGGGACAGCCCTCAGCTCAGGAGAGCCACCCTCACACCAGCGGT 
CCTGCGGCTAACGCTCAGGGAGTGCAGTCTTCGTTGGACACTTTTGAAGAAATTCTGCACGATAA 
ATTGAAAGTGCCGGGAAGCGAAAGCAGAACTGGCAATAGTTCTCCTGCCTCGGTGGAGCGGGAGA 
AGACAGATGC T TACAAAGTCC TGAAAACAGAAAT GAGT CAGAGAGGAAGTGGACAGT GCGT TAT T 
CAT T AC AGC AAAG GAT T T C G T T GGC AT C AAAAT C T AAG T T T G T T T T ACAAAGAT T G T T T T TAGT A 

CTAAGCTGCCTTGGCAGTTTGCATTTTTGAGCCAAACAAAAATATATTATTTTCCCTTCTAAGTA 
AAAAAAAAAAAAAAAAAAAA 
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FIGURE 166 



MAAAPGLLFWLFVLGALWWVPGQSDLSHGRRFSDLKVCGDEECSMLMYRGKALEDFTGPDCRFVN 
FKKGDDVYWYKLAGGSLELWAGSVEHSFGYFPKDLIKVLHKYTEEELHIPADETDFVCFEGGRD 
DFNSYNVEELLGSLELEDSVPEESKKAEEVSQHREKSPEESRGRELDPVPEPEAFRADSEDGEGA 
FSESTEGLQGQPSAQESHPHTSGPAANAQGVQSSLDTFEEILHDKLKVPGSESRTGNSSPASVER 
EKTDAYKVLKTEMSQRGSGQCVIHYSKGFRWHQNLSLFYKDCF 



Important features of" the protein: 
Signal peptide: 

f\ amino acids 1-22 

N-glycosylation site. 

%l amino acids 294-298 

5 CAMP- and cGMP-dependent protein kinase phosphorylation site. 

I J amino acids 30-34 



H\ Tyrosine kinase phosphorylation site. 

iy 

f"\ amino acids 67-76 

N-myristoylation sites. 

amino acids 205-211, 225-231, 277-283 

Amidation site. 

amino acids 28-32 
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FIGURE 167 



CCAGGACCAGGGCGCACCGGCTCAGCCTCTCACTTGTCAGAGGCCGGGGAAGAGAAGCAAAGCGC 
AACGGTGTGGTCCAAGCCGGGGCTTCTGCTTCGCCTCTAGGACATACACGGGACCCCGTAACTTC 
AGTCCCCCAAACGCGCACCCTCGAAGTCTTGAACTCCAGCCCCGCACATCCACGCGCGGCACAGG 
CGCGGCAGGCGGCAGGTCCCGGCCGAAGGCGATGCGCGCAGGGGGTCGGGCAGCTGGGCTCGGGC 
GGCGGGAGTAGGGCCCGGCAGGGAGGCAGGGAGGCTGCATATTCAGAGTCGCGGGCTGCGCCCTG 
GGCAGAGGCCGCCCTCGCTCCACGCAACACCTGCTGCTGCCACCGCGCCGCGATGAGCCGCGTGG 
TCTCGCTGCTGCTGGGCGCCGCGCTGCTCTGCGGCCACGGAGCCTTCTGCCGCCGCGTGGTCAGC 
GGCCAAAAGGTGTGTTTTGCTGACTTCAAGCATCCCTGCTACAAAATGGCCTACTTCCATGAACT 
GTCGAGCCGAGTGAGCTTTCAGGAGGCACGCCTGGCTTGTGAGAGTGAGGGAGGAGTCCTCCTCA 
GCCTTGAGAATGAAGCAGAACAGAAGTTAATAGAGAGCATGTTGCAAAACCTGACAAAACCCGGG 
ACAGGGATTTCTGATGGTGATTTCTGGATAGGGCTTTGGAGGAATGGAGATGGGCAAACATCTGG 
TGCCTGCCCAGATCTCTACCAGTGGTCTGATGGAAGCAATTCCCAGTACCGAAACTGGTACACAG 
ATGAACCTTCCTGCGGAAGTGAAAAGTGTGTTGTGATGTATCACCAACCAACTGCCAATCCTGGC 
CTTGGGGGTCCCTACCTTTACCAGTGGAATGATGACAGGTGTAACATGAAGCACAATTATATTTG 
CAAGTATGAACCAGAGATTAATCCAACAGCCCCTGTAGAAAAGCCTTATCTTACAAATCAACCAG 
GAGACACCCATCAGAATGTGGTTGTTACTGAAGCAGGTATAATTCCCAATCTAATTTATGTTGTT 
ATACCAACAATACCCCTGCTCTTACTGATACTGGTTGCTTTTGGAACCTGTTGTTTCCAGATGCT 
GC AT AAAAGT AAAGGAAGAACAAAAAC TAG T CGAAAC CAG T C T AC AC T G T GGAT T T CAAAGAG T A 
CCAGAAAAGAAAGTGGCATGGAAGT ATAA TAACTCATTGACTTGGTTCCAGAATTTTGTAATTCT 
GGATCTGTATAAGGAATGGCATCAGAACAATAGCTTGGAATGGCTTGAAATCACAAAGGATCTGC 
AAGATGAACTGTAAGCTCCCCCTTGAGGCAAATATTAAAGTAATTTTTATATGTCTATTATTTCA 
TTTAAAGAATATGCTGTGCTAATAATGGAGTGAGACATGCTTATTTTGCTAAAGGATGCACCCAA 
ACTTCAZ\ACTTCAAGCAAATGAAATGGACAATGCAGATAAAGTTGTTATCAACACGTCGGGAGTA 
TGTGTGTTAGAAGCAATTCCTTTTATTTCTTTCACCTTTCATAAGTTGTTATCTAGTCAATGTAA 
TGTATATTGTATTGAAATTTACAGTGTGCAAAAGTATTTTACCTTTGCATAAGTGTTTGATAAAA 
ATGAACTGTTCTAATATTTATTTTTATGGCATCTCATTTTTCAATACATGCTCTTTTGATTAAAG 
AAACTTATTACTGTTGTCAACTGAATTCACACACACACAAATATAGTACCATAGAAAAAGTTTGT 
TTTCTCGAAATAATTCATCTTTCAGCTTCTCTGCTTTTGGTCAATGTCTAGGAAATCTCTTCAGA 
AATAAG AAG C TAT T T CAT T AAG T G T G AT AT AAACC T C C T CAAAC AT T T T AC T T AGAG G C AAGGAT 
T GT C TAAT T T CAAT T G T GC AAGACAT G T GC C T T ATAAT TAT T T T T AGC T TAAAAT T AAAC AGAT T 
TTGTAATAATGTAACTTTGTTAATAGGTGCATAAACACTAATGCAGTCAATTTGAAf AAAAGAAG 
T GACAT AC ACAAT AT AAAT CAT AT G T C T T CACAC G T T GC C T AT AT AAT GAGAAG CAGC T C T C T GA 
GGGTTCTGAAATCAATGTGGTCCCTCTCTTGCCCACTAAACAAAGATGGTTGTTCGGGGTTTGGG 
ATTGACACTGGAGGCAGATAGTTGCAAAGTTAGTCTAAGGTTTCCCTAGCTGTATTTAGCCTCTG 
ACTATATTAGTATACAAAGAGGTCATGTGGTTGAGACCAGGTGAATAGTCACTATCAGTGTGGAG 
ACAAGCACAGCACACAGACAT T T TAGGAAGGAAAGGAAC TACGAAATCGTGT GAAAATGGGT TGG 
AACCCATCAGTGATCGCATATTCATTGATGAGGGTTTGCTTGAGATAGAAAATGGTGGCTCCTTT 
CTGTCTTATCTCCTAGTTTCTTCAATGCTTACGCCTTGTTCTTCTCAAGAGAAAGTTGTAACTCT 
CTGGTCTTCATATGTCCCTGTGCTCCTTTTAACCAAATAAAGAGTTCTTGTTTCTGGGGGAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 168 



MSRWSLLLGAALLCGHGAFCRRWSGQKVCFADFKHPCYKMAYFHELSSRVS FQEARLACESE 
GGVLLSLENEAEQKLIESMLQNLTKPGTGISDGDFWIGLWRNGDGQTSGACPDLYQWSDGSNSQ 
YRNWYTDEPSCGSEKCWMYHQPTANPGLGGPYLYQWNDDRCNMKHNYICKYEPEINPTAPVEK 
PYLTNQPGDTHQNVWTEAGIIPNLIYWIPTIPLLLLILVAFGTCCFQMLHKSKGRTKTSPNQ 
STLWISKSTRKESGMEV 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 

Transmembrane domain: 

amino acids 214-235 

N-glycosylation sites . 

amino acids 8 6-89, 255-258 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 2 66-2 69 

N-myristoylation sites. 

amino acids 27-32, 66-71, 91-96, 93-98, 102-107, 109-114, 140- 
145, 212-217 
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